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(57) Abstract: The present invention* relates ' 
to the significant functional role of, several ' 
C. elegans genes and of their corresponding 
gene products in cell division and proliferation 
processes that could be identified by means 
of RNA- mediated interference (RNAi) and to 
the identification and isolation of functional 
orthologues of said genes including all 
biologically-active derivatives thereof. The 
invention further relates to the use of said gene 
products (including said orthologues) in the 
development or isolation of antiproliferative 
agents, particularly their use in appropriate 
screening assays, and their use for diagnosis 
and treatment of proliferative diseases. 
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Eukaryotic cell division genes and their use in diagnosis and treatment of 

proliferative diseases 

5 ~~ ~~ ~ ~~ _ ~ 

In a first aspect, the present invention is related to the significant functional role of several 
C. elegans genes and of their corresponding gene products in cell division and proliferation 
processes that could be identified by means of RNA-mediated interference (RNAi). 

In a second aspect, the invention relates to the identification and isolation of functional 
10 orthologues of said genes and their gene products found in other eukaryotic species, in 
particular man, including all biologically-active derivatives thereof. 

In a third aspect, the present invention includes the use of said genes and gene products 
(including said orthologues) in the development or isolation of antiproliferative agents for 
instance their use in appropriate screening assays and in methods for diagnosis and 
1 5 treatment of proliferative diseases. 

In a forth aspect, the invention relates to antibodies to said gene products and their use in 
the development or isolation of antiproliferative agents and in methods for diagnosis and 
treatment of proliferative diseases. 

In a fifth aspect, the present invention is related to the use of these genes and gene products 
20 for developing structural models or other models for evaluating drug binding and efficacy 
as well as to any other uses which are derived from the new functions described here and 
which will become apparent from the disclosure of the present application for any person 
skilled in the art. 



25 Metazoan cell division consists of an extremely complex, highly regulated set of cellular 
processes which must be tightly co-ordinated, perfectly timed, and closely monitored in 
order to ensure the correct delivery of cellular materials to daughter cells. Defects in these 
processes are known to cause a wide range of so-called proliferative diseases, including all 
forms of cancer. Since cell division represents one of the few, if not the only cellular 

30 process that is common to the aetiology of all forais of cancer, its specific inhibition has 
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long been recognised as a preferred site of therapeutic intervention. Although mitotic 
inhibitor drugs are recognised as one of the most promising classes of chemotherapeutic 
agent, screening attempts to find new drug candidates in this class have been undermined 
by the strong inherent tendency of such screens to identify agents that target a single 
5 protein, tubulin. Tubulin polymerises to form microtubules, the primary cytoskeletal 
elements needed for mitotic spindle function and chromosome segregation. MicrotubjiLe 
functions, however, are ubiquitously needed in almost all cell types, whether dividing or 
not, a fact which therefore explains many of the unwanted side effects caused by anti- 
tubulin drugs. 

10 

Perhaps the best known example of a highly successful antineoplastic drug that targets 
tubulin is provided by paclitaxel, and its marketed derivative, Taxol, from Bristol Meyers 
Squibb. Its applicability has indeed been seriously limited by difficulties in determining an 
adequate dosing regimen due to a range of problematic side effects. Taxol treatment has 

15 resulted in anaphylaxis and severe hypersensitivity reactions characterised by dyspnea and 
hypotension requiring treatment, angioedema, and generalised urticaria in 2-4% of patients 
in clinical trials. All Taxol is administered after pretreatment with corticosteroids and 
despite pretreatment, fatal reactions have occurred. Severe conductance abnormalities 
resulting in life-threatening cardiac arrhythmia occur in less than 1 percent of patients and 

20 must be treated by insertion of a pacemaker. Taxol can cause fetal harm or fetal death in 
pregnant women. Furthermore, administration is commonly accompanied by tachycardia, 
hypotension, flushing, skin reactions and shortness-of-breath (mild dypsnea). 

Despite these shortcomings, Taxol has been hailed by many as the most successful new 
25 anti-cancer therapeutic of the last three decades. Clearly, there is good justification for 
attempting to add to the list of mitotic inhibitors used to treat cancer. However, additional 
drugs that target tubulin or interfere with microtubule dynamics may be expected to have 
similar applicability and limitations as Taxol. 



f 
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The task of the present invention therefore is to find new potential target proteins/genes for 
therapeutical drugs other than tubulin that are essential for completion of mitosis. These 
proteins/genes may provide novel targets to screen for new antineoplastic or cytotoxic 
anti-cancer agents. 

5 

Unfortunately, until now, the systematic identification of such target proteins/genes using 
genetic screening methods has been difficult in metazoans, and has relied heavily on the 
use of the unicellular yeast. Several major advances in the use of certain metazoan model 
organisms, particularly the nematode worm Caenorhabditis elegans, have now begun to 
1 0 offer new ways of bridging this gap. 



The above-mentioned task of the invention to find new potential target proteins/genes for 
therapeutical drugs other than tubulin involved in mitosis processes is solved by a 
screening assay in C. elegans based on 'genomic RNA mediated interference (RNAi)' 

15 combined with a highly probative microscopic assay for documenting the first rounds of 
embryonic cell division (Sulston et al, The embryonic cell lineage of the nematode 
Caenorhabditis elegans. Dev. Biol. 100, 64-119 (1983); Gonczy et al, Dissection of cell 
division processes in the one cell stage Caenorhabditis elegans embryo by mutational 
analysis. J Cell Biol 144, 927-946 (1999)). With this combination of techniques a selected 

20 gene and also a variety of selected genes can be functionally characterized with 
unprecedented speed and efficiency. 



The nematode C. elegans exhibits an almost entirely translucent body throughout its 
development, thereby offering unparalleled microscopic access for exquisitely detailed 

25 cytological documentation, even for the earliest steps of embryogenesis. This important 
feature, along with its short life cycle (3-5 days), its ease of cultivation, and its low 
maintenance costs, has helped make C. elegans arguably the best studied of all metazoans. 
Also, sequence data are now available for over 97% of the C. elegans genome (C elegans 
Sequencing Consortium. Genome sequence of the nematode C elegans: a platform for 

30 investigating biology. Science 282, 2012-2018 (1998)). Thus, C elegans has proven to be 
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an ideal organism for applying the new technique of RNA-mediated interference (RNAi). 
This technique consists in the targeted, sequence-specific inhibition of gene expression, as 
mediated by the introduction into an adult worm of double-stranded RNA (dsRNA) 
molecules corresponding to portions of the coding sequences of interest (Fire et aL, Potent 
5 and specific genetic interference by double-stranded RNA in Caenorhabditis elegans. 
Nature 391, 806-81 1 (1998)). For the vast majority of C. elegans genes tested to date, this 
has been shown to yield a sequence-specific inhibition of the targeted gene's expression, 
accompanied by clearly detectable loss of function phenotypes in the treated worm's Fl 
progeny (and even in some cases, in the treated worm itself). 

10 

A large-scale RNAi technique-based screen was performed for 2,232 (that means 96%) of 
the predicted open reading frames on chromosome III of C, elegans which is described in 
detail in Gonczy et al. s "Functional genomic analysis of cell division in C. elegans using 
RNAi of genes on chromosome IIP Nature 408, 331-336 (2000). For the performance of 
15 this large-scale screen double-stranded RNA corresponding to the individual open reading 
frames was produced and micro-injected into adult C. elegans hermaphrodites, and the 
resulting embryos were analysed 24 hours later using time-lapse DIC microscopy. 

Besides others, the C. elegans genes H38K22.2 (Genbank/EMBL ID: AL024499, provided 
in SEQ ID NO. 1 - 3), C02F5.1 (Genbank/EMBL ID: L14745; , provided in SEQ ID NO. 4 
20 and 5) and F10E9.8 (GenBank/EMBL ID: L10986; provided in SEQ ID NO. 6 and 7) gave 
rise to a phenotype detectable by the DIC-assay implying a functional role of these genes 
in metazoan cell division processes. 

In at least one case ( for H38K22.2) it had also been possible to identify a structurally and 
functionally homologous gene, a so-called orthologous gene, in another species, in 
25 particular Homo sapiens, namely the human orthologue RP42. 

For the mouse orthologue of the RP42 gene it had merely been known that the gene shows 
a strongly developmental^ regulated expression, particularly in proliferating neuroblasts 
from which neocortical neurons originate (Mas et al., "Cloning and expression of a novel 
gene, RP42, mapping to an autism susceptibility locus on 6Q16" Genomics 1; 65 (1), 70- 
30 74 (2000)). The functional role of RP42 in cell division and proliferation processes that 
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makes it an excellent tool for the development or identification of drugs for diagnosis and/ 
or therapy of proliferative diseases was not known so far. 

With the essential function of said genes in cell division and proliferation known, these 
5 newly identified target genes and their corresponding gene products, any homologues, 
orthologues and derivatives thereof represent excellent tools for use in the development 
and isolation of a wide range of therapeutics including anti-proliferative agents and in the 
development of methods for diagnosis and treatment of proliferative diseases. 

10 Therefore, in a first aspect, the present invention relates to isolated nucleic acid molecules 
encoding a polypeptide functionally involved in cell division and proliferation or a 
fragment thereof and comprising a nucleic acid sequence selected from the group 
consisting of: 

(a) the nucleic acid sequences presented in SEQ ID NO. 1 to 3, SEQ ID NO. 4 to 5, 
15 SEQ ID NO. 6 to 7, SEQ ID NO. 12 and fragments thereof and their 

complementary strands, 

(b) nucleic acid sequences encoding polypeptides that exhibit a sequence identity 
with SEQ ID NO. 8, SEQ ID NO. 9, SEQ ID NO. 10, SEQ ID NO. 1 1 or SEQ 
ID NO. 13 of at least 25 % over 100 residues and/or which are detectable in a 

20 computer aided search using the blast sequence analysis programs with an e- 

value of at most 10"* 30 , 

(c) nucleic acid sequences which are capable of hybridizing with the nucleic acid 
sequences of (a) or (b) under conditions of medium stringency, 

(d) nucleic acid sequences which are degenerate as a result of the genetic code to 
25 any of the sequences defined in (a), (b) or (c). 
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The above mentioned fragments of the isolated nucleic acid molecules may comprise a at 
least 15 nucleotides and preferably at least 20 nucleotides. 

Additionally the above mentioned isolated nucleic acid molecules may be single or double- 
stranded DNA-molecules as well as single- or double-stranded RNA-molecules. 

5 

a) : 

The nucleic acid sequences of those nucleic acid molecules encoding a polypeptide 
functionally involved in cell division and proliferation as mentioned in a) are provided in 
the sequence listing 

10 as SEQ ID NO. 1 - 3 (C. elegans genes H38K22.2 (Genbank/EMBL ID: AL024499)), 

as SEQ ID NO. 4 and 5 (C elegans gene C02F5.1 (Genbank/EMBL ID: L14745)), 

as SEQ ID NO. 6 and 7 (G elegans gene F10E9.8 (GenBank/EMBL ID: L10986)) and 

as SEQ ID NO. 12 (the human H38K22.2 orthologue, the RP42 protein (NCBI Accession 
No. AF292100). 

15 The corresponding deduced amino acid sequences of these target genes are disclosed in 
SEQ ID NO. 8 (for H38K22.2a), in SEQ ID NO. 9 (for H38K22.2b), in SEQ ID NO. 10 
(for C02F5.1), in SEQ ID NO. 1 1 (for F10E9.8) and in SEQ ID NO. 13 (for RP42). 

b) : 

20 Additionally, the present invention also comprises isolated nucleic acid molecules that are 
structurally and functionally homologous counterparts (particularly orthologues) of at least 
one of said target genes as disclosed in SEQ ID NO 1 to 7 or 12. 

Those homologous nucleic acid molecules may encode polypeptides that exhibit a 
sequence identity with SEQ ID NO. 8, SEQ ID NO. 9, SEQ ID NO. 10, SEQ ID NO. 1 1 or 
25 SEQ ID NO. 13 of at least 25 % over 100 residues, preferably of at least 30 % over 100 
residues, more preferably of at least 35 % over 100 residues and most preferably at least 40 
%over 100 residues. 
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Fig. 5 shows hat the aforementioned sequence identities are signifcant homologies that are 
appropriate to identify a polypeptide as an orthologue of the target proteins as depicted in 
SEQ ID NO. 8 -11, and 13. Fig. 5 shows a multiple sequence alignment of the H38K22.2a 
family on protein level generated with a BLAST sequence analysis program. In this 
5 alignment the two C. elegans splice variants H38K22.2a and H38K22.b are compared to 
their corresponding orthologues in Drosophila (CG7427), in mouse (AAF04863) and in 
Homo sapiens (AAH09478). The statistics in Fig 5 for the alignments show that the 
sequence identity on protein level between the C. elegans clone H38K22.2a and its human 
orthologue (AAH09478) is 36 % over 299 residues. Similarly, the sequence identities 
10 between C. elegans clone H38K22.2b (the other splice variant) and its human orthologue is 
36 % over 238 residues. It is obvious to anyone skilled in the art that these sequence 
homologies are significant homologies and that therefore the human clone with the 
accession No. AAH09478 is unambiguously identified as the human orthologue of the C. 
elegans clones H38K22.2a and H38K22.b. 

15 The invention also comprises isolated nucleic acid molecules that are detectable in a 
computer aided search using one of the BLAST sequence analysis programs with an e- 
value of at most 10~ 30 , preferably with an e-value of at most most 10" 35 3 more preferably 
with an e-value of at most most Iff 40 . 

Fig. 5 shows that the aforementioned e- values characterize signifcant sequence homologies 
20 that are appropriate to identify a polypeptide as an orthologue of the target proteins as 
depicted in SEQ ID NO. 8-11, and 13. 

The BLAST sequence analysis programs are programs used for sequence analysis that are 
publically available and known to anyone skilled in the art. When sequence alignments are 
done by a BLAST sequence analysis program, most of those programs calculate so called 
25 n e-values M to characterize the grade of homology between the compared sequences. 
Generally a small e-value characterizes a high sequence identity / homology, whereas 
larger e-values characterize lower sequence identities / homologies. 

"Homology" means the degree of identity between two known sequences. As stated above, 
homologies, that means sequence identities, may suitably be determined by means of 
30 computer programs known in the art. The degree of homology required for the sequence 
variant will depend upon the intended use of the sequence. It is well within the capability 
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of a person skilled in the art to effect mutational, insertional and deletional mutations 
which are designed to improve the function of the sequence or otherwise provide a 
methodological advantage. 

5 c): 

The present invention further relates to isolated nucleic acid sequences or fragments 
thereof which are capable of hybridizing with the nucleic acid sequences of (a) or (b) under 
conditions of medium/high stringency. 

The grade of sequence identity between a first and a second nucleic acid molecule can also 
10 be characterized by the capability of the first nucleic acid molecule to hybridize under 
certain conditions to a second nucleic acid molecule. 

Suitable experimental conditions for determining whether a given DNA or RNA sequence 
"hybridizes" to a specified polynucleotide or oligonucleotide probe involve presoaking of 
the filter containing the DNA or RNA to examine for hybridization in 5 x SSC (sodium 

15 chloride/sodium citrate) buffer for 10 minutes, and prehybridization of the filter in a 
solution of 5 x SSC, 5 x Denhardf s solution, 0,5 % SDS and 100 mg/ml of denaturated 
sonicated salmon sperm DNA (Maniatis et aL,1989), followed by hybridization in the same 
solution containing a concentration of 10 ng/ml of a random primed (Feinberg, A.P. and 
Vogelstein, B. (1983), Anal Biochem. 132:6-13), 32 P-dCTP-labeled (specific activity > 1 x 

20 10 9 cpm/^g) probe for 12 hours at approximately 45°C. The filter is then washed twice for 
30 minutes in 2 x SSC, 0,5% SDS at at least 55°C (low stringency), at least 60°C (medium 
stringency), preferably at least 65°C (medium/high stringency), more preferably at least 
70°C (high stringency) or most preferably at least 75°C (very high stringency). Molecules 
to which the probe hybridizes under the chosen conditions are detected using an x-ray film. 

25 

d): 

The present invention further relates to isolated nucleic acid molecules or fragments 
thereof which are degenerate as a result of the genetic code to any of the sequences defined 
in (a), (b) or (c). 
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The application of automated gene synthesis provides an opportunity for generating 
sequence variants of the naturally occurring genes. It will be appreciated, for example, that 
polynucleotides coding for the same gene products can be generated by substituting 
synonymous codons for those represented in the naturally occurring polynucleotide 
5 sequences as identified herein. Such sequences will be referred to as "degenerate" to the 
naturally occurring sequences. In addition, polynucleotides coding for synthetic variants of 
the corresponding amino acid sequences can be generated which, for example, will result 
in one or more amino acids substitutions, deletions or additions. Also, nucleic acid 
molecules comprising one or more synthetic nucleotide derivatives (including 

10 morpholinos) which provide said nucleotide sequence with a desired feature, e.g. a reactive 
or detectable group, can be prepared. Synthetic derivatives with desirable properties may 
also be included in the corresponding polypeptides. All such derivatives and fragments of 
the above identified genes and gene products showing at least part of the biological activity 
of the naturally occurring sequences or which are still suitable to be used, for example, as 

15 probes for, e.g. identification of homologous genes or gene products, are included within 
the scope of the present invention. 

Having herein provided the nucleotide sequences of various genes functionally involved in 
cell division and proliferation, it will be appreciated that automated techniques of gene 

20 synthesis and/or amplification may be used to isolate said nucleic acid molecules in vitro. 
Because of the length of some coding sequences, application of automated synthesis may 
require staged gene construction, in which regions of the gene up to about 300 nucleotides 
in length are synthesized individually and then ligated in correct succession for final 
assembly. Individually sythesized gene regions can be amplified prior to assembly, using 

25 polymerase chain reaction (PCR) technology. The technique of PCR amplification may 
also be used to directly generate all or part of the final genes/nucleic acid molecules. In this 
case, primers are synthesized which will be able to prime the PCR amplification of the 
final product, either in one piece or in several pieces that may be ligated together. For this 
purpose, either cDNA or genomic DNA may be used as the template for the PCR 

30 amplification. The cDNA template may be derived from commercially available or self- 
constructed cDNA libraries. 
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In a second aspect, the invention relates to nucleic acid probes comprising a nucleic acid 
sequence as previously characterized under (a) to (d) which may be a polynucleotide or an 
oligonucleotide comprising at least 15 nucleotides containing a detectable label. 
These nucleic acid probes may be synthesized by use of DNA synthesizers according to 
5 standard procedures or, preferably for long sequences, by use of PCR technology with a 
selected template sequence and selected primers. In the use of the nucleotide sequences as 
probes, the particular probe may be labeled with any suitable label known to those skilled 
in the art, including radioactive and non-radioactive labels. Typical radioactive labels 
include 32 P, I25 I, 35 S, or the like. A probe labeled with a radioactive isotope can be 

10 constructed from a DNA template by a conventional nick translation reaction using a 
DNase and DNA polymerase. Non-radioactive labels include, for example, ligands such as 
biotin or thyroxin, or various luminescent or fluorescent compounds. The probe may also 
be labeled at both ends with different types of labels, for example with an isotopic label at 
one end and a biotin label at the other end. The labeled probe and sample can then be 

15 combined in a hybridization buffer solution and held at an appropriate temperature until 
annealing occurs. 

The invention also includes an assay kit comprising either an isolated nucleic acid 
molecule as defined above or a fragment thereof or a probe as defined above in a suitable 
20 container. 

Duplex formation and stability depend on substantial complementarity between the two 
strands of a hybrid and a certain degree of mismatch can be tolerated. Therefore, the 
nucleic acid molecules and probes of the present invention may include mutations (both 
single and multiple), deletions, insertions of the above identified sequences, and 
25 combinations thereof, as long as said sequence variants still have substantial sequence 
homology to the original sequence which permits the formation of stable hybrids with the 
target nucleotide sequence of interest. 

The above identified nucleic acid molecules and probes coding for polypeptides 
30 functionally involved in cell division and proliferation or a part thereof will have a wide 
range of useful applications, including their use for identifying homologous, in particular 
orthologous, genes in the same or different species, their use in screening assays for 
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identification of interacting drugs that inhibit, stimulate or effect cell division or 
proliferation, their use for developing computational models, structural models or other 
models for evaluating drug binding and efficacy, and their diagnostic or therapeutic use for 
detection or treatment of diseases associated with anomalous and/or excessive cell division 

5 or proliferation, in particular neoplastic diseases, including both solid tumors and 
hemopoietic cancers, or coronary restenosis. Exemplary neoplastic diseases include 
carcinomas, such as adenocarcinomas and melanomas; mesodermal tumors, such as 
neuroblastomas and retinoblastomas; sarcomas and various leukemias; and lymphomas. Of 
particular interest are tumors of the breast, ovaries, gastrointestinal tract, liver, lung, 

10 thyroid glands, prostrate gland, brain, pancreas, urinary tract, and salivary glands. Still 
more specific, tumors of the breast, ovaries, lung, colon, and lymphomas are contemplated. 



In a third aspect, the present invention relates to the use of the above identified nucleic acid 
15 molecules and probes for diagnostic purposes. This diagnostic use of the above identified 
nucleic acid molecules and probes may include, but is not limited to the quantitative 
detection of the expression of said target genes in biological probes (preferably, but not 
limited to cell extracts, body fluids, etc.), particularly by quantitative hybridization to the 
endogenous nucleic acid molecules comprising the above-characterized nucleic acid 
20 sequences (particularly cDNA, RNA). An annormal and/or excessive expression of said 
target genes involved in cell division may be diagnosed that way. 

In a forth aspect, the present invention relates to the use of the above identified nucleic 
acid molecules, probes or their corresponding polypeptides for therapeutical purposes. 

25 

This therapeutical use of the above identified nucleic acid molecules, probes or their 
corresponding polypeptides may include, but is not limited to the use of said nucleic acid 
molecules and their corresponding polypeptides for direct or indirect inhibition of the 
expression of said target genes and/or for inhibition of the- function of said target genes. 
30 Particularly gene therapy vectors, e.g. viruses, or naked or encapsulated DNA or RNA (e.g. 
an antisense nucleotide sequence) with the above-identified sequences might be suitable 
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for the introduction into the body of a subject suffering from a proliferative disease or from 
a disease affecting cell division for therapeutical purposes. 

A particularly preferred therapeutical use of the above identified nucleic acid molecules or 
5 probes relates to their use in a therapeutical application of the RNAi technique, particularly 
in humans or in human cells. 

Double-stranded RNA oligonucleotides effect silencing of the expression of gene(s) which 
are highly homologous to either of the RNA strands in the duplex. Recent discoveries 
reveal that this effect, called RNA interference (RNAi), that had been originally discovered 

10 in C. elegans, can also be observed in cells, particularly in human cells. Therefore the 
invention further comprises the use of double-stranded RNA oligonucleotides with the 
above identified nucleotide sequences (as stated in a) to d)), preferably with a length of at 
least 15 nucleotides (nt), more preferably with a length of at least 20 nt, for therapeutical 
silencing of the expression of genes involved in cell division or proliferation in cells ot 

15 other species, particularly in human cells. This therapeutical use particularly applies to 
cells of an individual that suffers from a disease associated with anormalous and/or 
excessive cell division or proliferation, particularly a coronary restinosis or a neoplastic 
disease selected from the group consisting of lymphoma, lung cancer, colon cancer, 
ovarian cancer and breast cancer. 

20 

In a fifth aspect, the invention further comprises a nucleic acid construct or a recombinant 
vector having incorporated the nucleic acid molecules as defined in (a) to (d) or a fragment 
thereof. 

"Nucleic acid construct" is defined herein as any nucleic acid molecule, either single- or 
25 double-stranded, in which nucleic acid sequences are combined and juxtaposed in a 
manner which will not occur naturally. The vector may be any vector which can be 
conveniently subjected to recombinant DNA procedures. The choice of the vector will 
usually depend on the host cell into which it is to be introduced. The vector may be an 
extrachromosomal entity, the replication of which is independent of chromosomal 
30 replication, e.g. a plasmid. Alternatively, the vector may be one which, when introduced 
into a host cell, is integrated into the host cell genome and replicated together with the 
chromosome(s) into which it has been integrated. 
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The vector is preferably an expression vector in which the nucleic acid molecule as defined 
in (a) to (d) or a fragment thereof is operably linked to heterologous or homologous control 
sequences. The term "control sequences" is defined herein to include all components 
5 which are necessary or advantageous for expression of the coding nucleic acid sequence. 
Such control sequences include, but are not limited to, a promoter, a ribosome binding site, 
translation initiation and termination signals and, optionally, a repressor gene or various 
activator genes. Control sequences are referred to as "homologous" if they are naturally 
linked to the coding nucleic acid sequence of interest and referred to as "heterologous" if 
10 this is not the case. The term "operably linked" indicates that the sequences are arranged so 
that they function in concert for their intended purpose, i.e. expression of the desired 
protein. 

The promoter may be any DNA sequence which shows transcriptional activity in the host 
15 cell of choice and may be derived from genes encoding proteins either homologous or 
heterologous to the host cell. 

Examples of suitable promoters for directing the transcription in a bacterial host are, e.g., 
the phage Lambda P R or P L promoters, the lac, trp or tac promoters of E. coli, the promoter 
20 of the Bacillus subtilis alkaline protease gene or the Bacillus licheniformis alpha-amylase 
gene. 

Examples of suitable promoters for directing the transcription in mammalian cells are, e.g., 
the SV40 promoter (Subramani et al., Mol Cell Biol 1 (1981), 854-864), the MT-1 
25 (metallothionein gene) promoter (Palmiter et al., Science 222 (1983), 809-814) or the 
adenovirus 2 major late promoter. 

Examples of suitable promoters for use in insect cells are, e.g., the polyhedrin promoter 
(Vasuvedan et al., Febs. Lett 311, (1992), 7-11), the Autographa californica polyhedrosis 
30 basic protein promoter (EP 397 485), or the baculovirus immediate early gene 1 promoter 
(US 5,155,037, US 5,162,222). 
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Examples of suitable promoters for use in yeast cells include promoters from yeast 
glycolytic genes (Hitzeman et al., J. Biol Chem. 255 (1980), 1203-12080; Alber and 
Kawasaki, 1 Mol AppL Gen. 1 (1982), 419-434) and the ADH2-4c promoter (Russell et 
al. Nature 304 (1983), 652-654). 

5 

The coding sequence may, if necessary, be operably linked to a suitable terminator, such as 
the human growth hormone terminator (Palmiter et al, Science 222, 809-814 (1983)), or a 
polyadenylation sequence. Also, to permit secretion of the expressed protein, a signal 
sequence may precede the coding sequence. 

10 

Further, the vector may comprise a DNA sequence enabling the vector to replicate in the 
host cell in question. Examples of such sequences are the origins of replication of the 
plasmids pUC19, pACYC177, pUBHO, pE194, pAMBl and pIJ702. Another example of 
such a sequence (when the host cell is a mammalian cell) is the S V40 origin of replication. 
15 When the host cell is a yeast cell, suitable sequences enabling the vector to replicate are the 
yeast plasmid 2\i replication genes REP 1-3 and origin of replication. 

The vector may also comprise a selectable marker, e.g. a gene coding for a product which 
complements a defect in the host cell, such as the gene coding for dihydrofolate reductase 
20 (DHFR) or a gene which confers resistance to a drug, e.g. ampicillin, kanamycin, 
tetracyclin, chloramphenicol, neomycin or hygromycin. 

A number of vectors suitable for expression in prokaryotic or eukaryotic cells are known in 
the art and several of them are commercially available. Some commercially available 
25 mammalian expression vectors which may be suitable include, but are not limited to, 
pMClneo (Stratagene), pXTl (Stratagene), pSG5 (Stratagene), pcDNAI (Invitrogen), 
EBO-pSV2-neo (ATCC 37593), pBPV-l(8-2) (ATCC 371 10), pSV2-dhfr (ATCC 37146). 

In a sixth aspect, the invention comprises host cells into which the nucleic acid construct or 
30 the recombinant vector is introduced. These host cells may be prokaryotic or eukaryotic, 
including, but not limited to, bacteria, fungal cells, including yeast and filamentous fungi, 
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mammalian cells, including, but not limited to, cell lines of human, bovine, porcine, 
monkey and rodent origin, and insect cells including, but not limited to, drosophila derived 
cell lines. 

5 The selection of an appropriate host cell will be dependent on a number of factors 
recognized by the art. These include, e.g., compatibility with the chosen vector, toxicity of 
the (co)products, ease of recovery of the desired protein or polypeptide, expression 
characteristics, biosafety and costs. 

Examples of suitable prokaryotic cells are gram positive bacteria such as Bacillus subtilis, 
10 Bacillus licheniformis, Bacillus brevis, Streptomyces lividans etc. or gram negative 
bacteria such as E. coli. 

The yeast host cell may be selected from a species of Saccharomyces or 
Schizosaccharomyces, e.g. Saccharomyces cerevisiae. Useful filamentous fungi may be 
selected from a species of Aspergillus, e.g. Aspergillus oryzae or Aspergillus niger. 
15 Cell lines derived from mammalian species which may be suitable and which are 
commercially available include, but are not limited to, COS-1 (ATCC CRL 1650) COS-7 
(ATCC CRL 1651), CHO-K1 (ATCC CCL 61), 3T3 (ATCCL 92), NIH/3T3 (ATCC CRL 
1658), HeLa (ATCCL 2), and MRC-5 (ATCC CCL 171). 

20 The recombinant vector may be introduced into the host cells according to any one of a 
number of techniques including, but not limited to, transformation, transfection, protoplast 
fusion, and electroporation. 

The recombinant host cells are then cultivated in a suitable nutrient medium under 
25 conditions permitting the expression of the protein of interest. The medium used to 
cultivate the cells may be any conventional medium suitable for growing the host cells, 
such as minimal or complex media containing appropriate supplements. Suitable media are 
available from commercial suppliers or may be prepared according to published recipes 
(e.g. in catalogues of the American Type Culture Collection). 

30 
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Identification of the heterologous polypeptide expressing host cell clones may be done by 
several means, including, but not limited to, immunological reactivity with specific 
antibodies. 

5 In a seventh aspect, the invention is related to a method for producing a polypeptide 
functionally involved in cell division and proliferation or a fragment thereof in a host cell 
comprising the steps 

(i) transferring the expression vector with an operably linked nucleic acid 
molecule as defined in (a) to (d) into a suitable host cell, and 

10 (ii) cultivating the host cells of step (i) under conditions which will permit the 

expression of said polypeptide or fragment thereof and 

(iii) optionally, secretion of the expressed polypeptide into the culture medium. 

In an eigth aspect, the invention comprises a polypeptide functionally involved in cell 
15 division and proliferation or a fragment thereof comprising an amino acid sequence 
selected from the group consisting of: 

(a) the amino acid sequences depicted in SEQ ID NO. 8, 9, 10 5 11 and 13 and 
fragments thereof, 

(b) amino acid sequences which exhibit a sequence identity with the sequences of 
20 (a) of at least 25 % over 100 residues, preferably of at least 30 % over 100 

residues, more preferably of at least 35 % over 100 residues and most 
preferably of at least 40 % over a 100 residues and/or which are detectable in a 
computer aided search using the BLAST sequence analysis programs with an e- 
value of at most 10" 30 , preferably with an e-value of at most 10" 35 and most 
25 preferably with an e-value of at most 1 0" 40 , 

(c) amino acid sequences encoded by a nucleic acid molecule that is capable of 
hybridizing with the nucleic acid sequences of (a) or (b) or encoded by a nucleic 
acid molecule that is degenerate as a result of the genetic code to any of the 
sequences as defined in (a) or (b). 
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The heterologous polypeptide may also be a fusion polypeptide in which another 
polypeptide is fused at the N-terminus or the C-terminus of the polypeptide of interest or 
fragment thereof. A fused polypeptide is produced by fusing a nucleic acid sequence (or a 
portion thereof) encoding another polypeptide to a nucleic acid sequence (or a portion 
5 thereof) of the present invention. Techniques for producing fusion polypeptides are known 
in the art and include ligating the coding sequences so that they are in frame and the 
expression of the fusion polypeptide is under control of the same promotor(s) and 
terminator. 

10 Expression of the polypeptides of interest may also be performed using in vitro produced 
synthetic mRNA. Synthetic mRNA can be efficiently translated in various cell-free 
systems, including but not limited to, wheat germ extracts and reticulocyte extracts, as well 
as efficiently translated in cell based systems including, but not limited to, microinjection 
into frog oocytes, preferably Xenopus oocytes. 

15 

In a ninth aspect, the invention involves antibodies against the above identified 
polypeptides and against immunogenic fragments thereof. The term "antibody" as used 
herein includes both polyclonal and monoclonal antibodies, as well as fragments thereof, 
such as Fv, Fab and F(ab) 2 fragments that are capable of binding antigen or hapten. The 

20 present invention also contemplates "humanized" hybrid antibodies wherein amino acid 
sequences of a non-human donor antibody exhibiting a desired antigen-specifity are 
combined with sequences of a human acceptor antibody. The donor sequences will usually 
include at least the antigen-binding amino acid residues of the donor but may comprise 
other structurally and/or functionally relevant amino acid residues of the donor antibody as 

25 well. Such hybrids can be prepared by several methods well known in the art (see e.g. WO 
89/09622; WO 94/1 1509; Couto, Hybridoma 13 (1994), 215-219; Presta, Cancer Research 
57 (1997), 4593-4599). The antibodies of the present invention will have a wide range of 
useful applications, including their use for affinity purification of the corresponding 
immunogenic (poly)peptides, their use for the preparation of anti-idiotypic antibodies, as 

30 well as their use as specific binding agents in various assays, e.g. diagnostic or drug- 
screening assays, or in a method for treatment of diseases associated with anomalous 
and/or excessive cell division or proliferation as exemplified above. Specifically, said 
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antibodies or suitable fragments thereof, particularly in humanized form, may be used as 
therapeutic agents in a method for treating cancer and other diseases associated with 
anomalous and/or excessive cell division or proliferation as exemplified above. Also, 
antibodies may be raised to the most characteristic parts of the above identified 
5 polypeptides and subsequently be used to identify structurally and/or functionally related 
polypeptides from other sources as well as mutations and derivatives of the above 
identified polypeptides. 

To raise antibodies against the polypeptides of the present invention, there may be used as 
an immunogen either the intact polypeptide or an immunogenic fragment thereof, produced 
10 in a suitable host cell as described above or by standard peptide synthesis techniques. 

Polyclonal antibodies are raised by immunizing animals, such as mice, rats, guinea pigs, 
rabbits, goats, sheep, horses etc., with an appropriate concentration of the polypeptide or 
peptide fragment of interest either with or without an immune adjuvant. 

Acceptable immune adjuvants include, but are not limited to, FrewuTs complete adjuvant, 
15 Freund's incomplete adjuvant, alum-precipitate, water-in-oil-emulsion containing 
Coiynebacterium parvum and tRNA. 

In a typical immunization protocol each animal receives between about 0,1 jig and about 
1000 jig of the immunogen at multiple sites either subcutaneously (SC), intraperitoneal^ 
(IP), intradermally or in any combination thereof in an initial immunization. The animals 

20 may or may not receive booster injections following the initial injection. Those animals 
receiving booster injections are generally given an equal amount of the immunogen in 
Freund's incomplete adjuvant by the same route at intervals of about three or four weeks 
until maximal titers are obtained. At about 7-14 days after each booster immunization or 
about weekly after a single immunization, the animals are bled, the serum collected, and 

25 aliquots are stored at about -20°C. 

Monoclonal antibodies which are reactive with the polypeptide or peptide fragment of 
interest are prepared using basically the technique of Kohler and Milstein, Nature 256: 
495-497 (1975). First, animals, e.g. Balb/c mice, are immunized using a protocol similar to 
30 that described above. Lymphocytes from antibody-positive animals, preferably 
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splenocytes, are obtained by removing spleens from immunized animals by standard 
procedures known in the art. Hybridoma cells are produced by mixing the splenocytes with 
an appropriate fusion partner, preferably myeloma cells, under conditions which will allow 
the formation of stable hybridomas. Fusion partners may include, but are not limited to: 
5 mouse myelomas P3/NSl/Ag 4-1; MPC-11; S-194 and Sp 2/0. Fused hybridoma cells are 
selected by growth in a selection medium and are screened for antibody production. 
Positive hybridomas may be grown and injected into, e.g., pristane-primed Balb/c mice for 
ascites production. Ascites fluid is collected about 1-2 weeks after cell transfer and the 
monoclonal antibodies are purified by techniques known in the art. Alternatively, in vitro 
10 production of monoclonal antibodies (mAb) is possible by cultivating the hybridomas in a 
suitable medium, e.g. DMEM with fetal calf serum, and recovering the mAb by 
techniques known in the art. 

Recovered antibody can then be coupled covalently to a detectable label, such as a 
radiolabel, enzyme label, luminescent label, fluorescent label or the like, using linker 
1 5 technology established for this purpose. 

Antibody titers of ascites or hybridoma culture fluids are determined by various serological 
or immunological assays which include, but are not limited to, precipitation, passive 
agglutination, enzyme-linked immunosorbent antibody (ELISA) technique and 
radioimmunoassay techniques. Similar assays may be used to detect the presence of the 
20 above identified polypeptides or fragments thereof in body fluids or tissue and cell 
extracts. 

Assay kits for performing the various assays mentioned in the present application may 
comprise suitable isolated nucleic acid or amino acid sequences of the above identified 
genes or gene products, labelled or unlabelled, and/or specific ligands (e.g. antibodies) 
25 thereto and auxiliary reagents as appropriate and known in the art. The assays may be 
liquid phase assays as well as solid phase assays (i.e. with one or more reagents 
immobilized on a support). 

Unless otherwise specified, the manipulations of nucleic acids and polypeptidesAproteins 
can be performed using standard methods of molecular biology and immunology (see, e.g. 
30 Maniatis et al. (1989), Molecular cloning: A laboratory manual, Cold Spring Harbor Lab., 
Cold Spring Harbor, NY; Ausubel, F.M. et al. (eds.) "Current protocols in Molecular 



WO 02/38805 



PCT/EPOI/13034 



-20« 

Biology". John Wiley and Sons, 1995; Tijssen, P., Practice and Theory of Enzyme 
Immunoassays, Elsevier Press, Amsterdam, Oxford, New York, 1985). 

The invention further includes an assay kit comprising either the polypeptide as defined 
5 above or a fragment thereof or an antibody against said polypeptides as defined above or 
against immunogenic fragments thereof. 

These recombinant polypeptides or fragments thereof as well as antibodies against those 
polypeptides or immunogenic fragments thereof will have a wide range of useful 

10 applications, including their use in screening assays for interacting drugs that inhibit, 
stimulate or effect the cell division or proliferation, their use for developing computational 
models, structural models or other models for evaluating drug binding and efficacy, and 
their use in a method for diagnosis or treatment of diseases associated with anomalous 
and/or excessive cell division or proliferation, in particular neoplastic diseases, including 

15 both solid tumors and hemopoietic cancers, or coronary restenosis. Exemplary neoplastic 
diseases include carcinomas, such as adenocarcinomas and melanomas; mesodermal 
tumors, such as neuroblastomas and retinoblastomas; sarcomas and various leukemias; and 
lymphomas. Of particular interest are tumors of the breast, ovaries, gastrointestinal tract, 
liver, lung, thyroid glands, prostrate gland, brain, pancreas, urinary tract, and salivary 

20 glands. Still more specific, tumors of the breast, ovaries, lung, colon, and lymphomas are 
contemplated. 

Therefore in a tenth aspect, the present invention explicitly includes the use of 
polypeptides as defined above or fragments thereof or of antibodies against said 
25 polypeptides or immunogenic fragments thereof in a screening assay for interacting drugs 
that inhibit, stimulate or effect the cell division or proliferation. 

Such a screening assay for interacting drugs may particularly comprise, but is not limited 
to the following steps: 

30 

1 . recombinant expression of said polypeptide or of an appropriate derivative thereof 
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isolation and optionally purification of the recombinantly expressed polypeptide or 
of its derivative, in particular by affinity chromatography 

optionally labelling of the chemical compounds that are tested to interact with said 
polypeptide or its derivative and/or labelling of the recombinantly expressed 
polypeptide 

immobilization of the recombinantly expressed polypeptide or of its derivative to a 
solid phase 

binding of a potential interaction partner or a variety thereof to the immobilized 
polypeptide or its derivative 
optionally one or more washing steps 

detection and/or quantification of the interaction, in particular by monitoring the 
amount of label remaining associated with the solid phase over background levels. 

Step 1 includes the recombinant expression of the above identified polypeptide or of its 

15 derivative from a suitable expression system, in particular from cell-free translation, 
bacterial expression, or baculusvirus-based expression in insect cells. 
Step 2 comprises the isolation and optionally the subsequent purification of said 
recombinantly expressed polypeptides with appropriate biochemical techniques that are 
familiar to a person skilled in the art. 

20 Alternatively, these screening assays may also include the expression of derivatives of the 
above identified polypeptides which comprises the expression of said polypeptides as a 
fusion protein or as a modified protein, in particular as a GST-fusion protein or as a protein 
bearing a so called "tag"-sequence. These "tags"-sequences consist of short nucleotide 
sequences that are ligated 'in frame' either to the N- or to the C-terminal end of the coding 

25 region of said target gene. One of the most common tags that are used to label 
recombinantly expressed genes is the poly-Histidine-tag which encodes a homopolypeptide 
consisting merely of histidines. In this context the term "polypeptide" does not merely 
comprise polypeptides with the nucleic acid sequences of SEQ ID No. 1 bis 7, their 
naturally occuring homologues, preferably orthologues, more preferably human 

30 orthologues, in particular the RP42 gene (SEQ ID No. 12), but also derivatives of these 
polypeptides, in particular fusion proteins or polypeptides comprising a tag-sequence. 
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These polypeptides, particularly those labelled by an appropriate tag-sequence (for 
instance a His-tag) or by GST, may be purified by standard affinity chromatography 
protocols, in particular by using chromatography resins linked to anti-His-tag-antibodies or 
to anti-GST-antibodies which are both commercially available. Alternatively to the use of 
5 anti-tag- or anti-GST-antibodies or other 'label-specific' antibodies the purification may 
also involve the use of antibodies against said polypeptides. Screening assays that involve 
a purification step of the recombinantly expressed target genes as described above (step 2) 
are preferred embodiments of this aspect of the invention. 

In a third - optional - step the compounds tested for interaction may be labelled by 
10 incorporation of radioactive isotopes or by reaction with luminescent or fluorescent 
compounds. Alternatively or additionally also the recombinantly expressed polypeptide 
may be labelled. 

In a forth step the recombinantly expressed polypeptide is immobilized to a solid phase, 
particularly (but not limited) to a chromatography resin. The coupling to the solid phase is 
15 thereby preferably established by the generation of covalent bonds. 

In a fifth step a candidate chemical compound that might be a potential interaction partner 
of the said recombinant polypeptide or a complex variety thereof (particularly a drug 
library) is brought into contact with the immobilized polypeptide. 

In a sixth - optional - step one or several washing steps may be performed. As a result just 
20 compounds that strongly interact with the immobilized polypeptide remain bound to the 
solid (immobilized) phase. 

In step 7 the interaction between the polypeptide and the specific compound is detected, in 
particular by monitoring the amount of label remaining associated with the solid phase 
over background levels. 

25 

Brief Description of the Drawings 

Fig. 1 shows DIC microscopy images taken from time-lapse recording of the first two 
rounds of embryonic cell division in wild type C elegans. 

Fig. 2 shows DIC microscopy images taken from time-lapse recording of the first two 
30 rounds of embryonic cell division in C. elegans Fl progeny from F0 parent treated 

with ds RNA "300C3" or "340G12" directed against gene H38K22.2. 
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Fig. 3 shows DIC microscopy images taken from time-lapse recording of the first two 
rounds of embryonic cell division in C. elegans Fl progeny from F0 parent treated 
with dsRNA "307C1" directed against gene C02F5.1. 

Fig. 4 shows shows DIC microscopy images taken from time-lapse recording of the first 
5 two rounds of embryonic cell division in C. elegans Fl progeny from F0 parent 

treated with ds RNA "305 A12" directed against gene F10E9.8. 

Fig 5 shows a multiple sequence alignment of the H38K22.2a family. Herein, the amino 
acid sequences of the two C. elegans splice variants H38K22.2a and H38K22.2b 
are compared to the amino acid sequences of their orthologues in Drosophila 
10 (CG7427), in mouse (AAF04863) and in homo sapiens (AAH09478). 

The "statistics" refer to values that characterize the grade of homology between the 
individual sequences, as the e-value, the sequence identities and the conservatively 
changed residues (positives). 



15 

Description of the sequence protocol: 

SEQ ID NO. 1 shows the unspliced DNA sequence common to both isoforms a and 
b of the C. elegans gene H38K22.2 (3 104 bp). 

SEQ ID NO. 2 shows the spliced DNA sequence of the C. elegans gene H38K22.2a 

20 isoform(1011 bp). 

SEQ ID NO. 3 shows the spliced DNA sequence of the C. elegans gene H38K22.2b 

isoform (852 bp). 

SEQ ID NO. 4 shows the unspliced DNA sequence of the C. elegans gene C02F5. 1 
(3308 bp). 

25 SEQ ID NO. Sshows the spliced DNA sequence of the C. elegans gene C02F5.1 

(3033 bp). 

SEQ ID NO. 6 shows the unspliced DNA sequence of the C. elegans gene F 1 0E9.8 
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SEQ ID NO. 7 



SEQIDNO. 8 



SEQ ED NO. 9 



SEQIDNO. 10 



10 SEQIDNO. 11 



SEQIDNO. 12 



SEQIDNO. 13 



15 



(7097 bp). 

shows the spliced DNA sequence of the C. elegans gene F10E9.8 
(3624 bp). 

shows the deduced amino acid sequence of the C. elegans gene 
H38K22.2a isoform (336 aa). 

shows the deduced amino acid sequence of the C. elegans gene 
H38K22.2b isoform (283 aa). 

shows the deduced amino acid sequence of the C. elegans gene 
C02F5.1 (1010 aa). 

shows the deduced amino acid sequence of the C. elegans gene 
F10E9.8 (1207 aa). 

shows the cDNA sequence of a human orthologue of H38K22.2 
(780 bp). 

shows the deduced amino acid sequence of a human orthologue of 
H38K22.2 (260 aa). 



The following examples illustrate the present invention without, however, limiting the 
20 same thereto. 



EXAMPLE 1: Generation of dsRNA molecules for RNAi experiments 

First, oligonucleotide primer pair sequences were selected to amplify portions of the gene 
25 of interest's coding region using standard PCR techniques. Primer pairs were chosen to 
yield PCR products containing at least 500 bases of coding sequence, or a maximum of 
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coding bases for genes smaller than 500 bases. In order to permit the subsequent use of the 
PCR product as a template for in vitro RNA transcription reactions from both DNA 
strands, the T7 polymerase promoter sequence "TAATACGACTCACTATAGG" was 
added to the 5 f end of forward primers, and the T3 polymerase promoter sequence 
5 "AATTAACCCTCACTAAAGG" was added to the 5' end of reverse primers. The 
synthesis of oligonucleotide primers was completed by a commercial supplier (Sigma- 
Genosys, UK or MWG-Biotech, Germany). 

PCR reactions were performed in a volume of 50 |il, with Taq polymerase using 0.8 |iM 
primers and approximately 0.1 [ig of wild-type (N2 strain) genomic DNA template. The 

10 PCR products were EtOH precipitated, washed with 70% EtOH and resuspended in 7.0 juil 
TE. 1.0 \il of the PCR reaction was pipetted into each of two fresh tubes for 5 |il 
transcription reactions using T3 and T7 RNA polymerases. The separate T3 and T7 
transcription reactions were performed according to the manufacturer's instructions 
(Ambion, Megascript kit), each diluted to 50 \i\ with RNase-free water and then combined. 

15 The mixed RNA was purified using RNeasy kits according to the manufacturer's 
instructions (Qiagen), and eluted into a total of 130 ^1 of RNase-free H 2 0. 50 |ul of this 
was mixed with 10 ^1 6X injection buffer (40 mM KP0 4 pH 7.5, 6 mM potassium citrate, 
pH 7.5, 4% PEG 6000). The RNA was annealed by heating at 68°C for 10 min, and at 37°C 
for 30 min. Concentration of the final dsRNAs were measured to be in the range of 0.1-0.3 

20 Hg/^1. The products of the PCR reaction, of the T3 and T7 transcription reactions, as well' 
as the dsRNA species were run on 1% agarose gels to be examined for quality control 
purposes. Success of double stranding was assessed by scoring shift in gel mobility with 
respect to single stranded RNA, when run on non-denaturing gels. 

25 

EXAMPLE 2: Injections of dsRNA and phenotypic assays 

dsRNAs were injected bilaterally into the syncitial portion of both gonads of wild-type (N2 
strain) young adult hermaphrodites, and the animals incubated at 20°C for 24 hrs. 
30 Embryos were then dissected out from the injected animals and analyzed by time-lapse 
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differential interference contrast videomicroscopy for potential defects in cell division 

processes, capturing 1 image every 5 seconds, as previously described (Gonczy et al, 
Dissection of cell division processes in the one cell stage Caenorhabditis elegans embryo 
by mutational analysis. J Cell Biol 144, 927-946 (1999)). For each experiment, embryos 
5 from at least 3 different injected worms were filmed in this manner, from shortly after 
fertilization until the four cell stage. Embryos from 2 additional injected worms were also 
recorded via still images, thus yielding phenotypic documentation for at least 5 injected 
worms in each experiment. 

In some cases, embryos exhibited acute sensitivity to osmotic changes, as evidenced by 
10 their loss of structural integrity during the dissection of the injected animals. In order to 
overcome this limitation, injected animals were not dissected, but rather, anaesthetized for 
10 min in M9 medium containing 0.1% tricaine and 0.01% tetramisole, and mounted intact 
on an agarose pad to observe the Fl embryogenesis in utero (Kirby et al., Dev. Biol. 142, 
203-215 (1990)). The resolution achieved by viewing through the body wall does not equal 
15 that achieved by observing dissected embryos, and only limited phenotypic analysis was 
conducted in these cases. 

Three injected animals were also transferred to a fresh plate 24 hrs after injection of 
dsRNA, and left at 20°C. Two days later, the plate was checked with a stereomicroscope 
(20-40x total magnification) for the presence of Fl larvae (L2's-L4's), as well as their 
20 developmental stage. Two days after that, the plate was inspected again for the presence of 
Fl adults, as well as their overall body morphology and the presence of F2 progeny. 



EXAMPLE 3: Characterization of the C. elegans gene H38K22.2 

25 

Two dsRNAs, n 300C3 M and "340G12", were designed and used to specifically silence the 
expression of the C. elegans gene H38K22.2 by RNAi, thereby testing its functional 
involvement in the first 2 rounds of embryonic cell division in this metazoan species. The 
dsRNAs were synthesized in vitro from PCR-amplified wild type genomic DNA fragments 
30 of the H38K22.2 gene. For the PCR, two sets of primer pairs were used: 



WO 02/38805 



PCT/EP01/13034 



-27- 

"TC AATC AGTATGTCG ACCC " with "GGAAGAAATTGGGGAAACA" as forward 
and reverse primers, respectively, to generate dsRNA "300C3", and 
"ATCGAGCGCCTCTTCAATC" with "TGGTGTCTCCATTTGCTGA" as forward and 
reverse primers, respectively, to generate dsRNA "340G12". The dsRNAs were purified, 
5 and injected into adult hermaphrodite worms. The phenotypic consequences of the RNAi 
treatment were documented 24 hours later in the Fl progeny of injected worms, using 
time-lapse differential interference contrast (DIC) microscopy. Embryo recordings started 
-20 minutes after fertilisation, while the female pronucleus is completing its meiotic 
divisions, until the 4 cell stage, -30 minutes later. 

10 In the Fl progeny of control worms that were either not injected, or injected with irrelevant 
dsRNA, the cellular events of the first two rounds of embryonic cell division were found to 
exhibit very limited variability, as observed by DIC microscopy. All processes that were 
examined and scored for the possibility of phenotypic deviations are listed and illustrated 
in Figure 1. Briefly, the anteroposterior polarity of the embryo is initially determined by 

15 the position of the male pronucleus at the cortex, shortly after entry into the egg (right 
arrow in Fig. la). This is accompanied by a clear, coordinated flow of yolk granules 
through the central portion of the cytoplasm along the embryo's longitudinal axis towards 
the male pronucleus, and a concomitant series of cortical waves or ruffles progressing 
towards the anterior of the embryo (left side in Fig.l). Shortly thereafter, the male and 

20 female pronuclei undergo highly patterned migrations (right and left arrows respectively, 
in Fig. la,b) resulting in their meeting within the posterior half of the embryo (Fig. lc), 
followed by a centration and rotation (Fig. Id) of the pronuclear pair and associated 
centrosomes (arrowheads in Fig. lb-d) to set up the future mitotic spindle along the 
embryo's longitudinal axis. After synchronous breakdown of the pronuclear envelopes, the 

25 clearly bipolar mitotic spindle is initially short (Fig. le), but then elongates while 
exhibiting clear lateral "rocking" movements of the posterior pole (Fig. lf-h). These 
movements are accompanied by a slight posterior displacement of the posterior spindle 
pole, while the anterior spindle pole remains approximately stationary. This then results in 
an asymmetric positioning of the spindle during anaphase and telophase, thereby yielding 

30 an asymmetric placement of the cytokinetic furrow (arrowheads in Fig. lij), and 
generating unequally-sized daughter cells: a smaller posterior PI blastomere (right cell in 
Fig. lk-o), and larger anterior AB blastomere (left cell in Fig. lk-n). While the AB nucleus 
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then migrates directly to the center of the AB cell (left arrow in Fig. lk-l) 5 the PI nucleus 
typically migrates further towards the posterior of that cell (right arrow in Fig. lk-1), before 
undergoing a pronounced 90° rotation while re-migrating to the anterior PI cortex with one 
of its duplicated centrosomes leading (arrowheads in Fig. lm). This insures that the PI 
5 blastomere then divides along the embryo's longitudinal axis, perpendicular to that of the 
AB blastomere (Fig. In, arrowheads indicate centrosomes). These two divisions occur 
asynchronously, with PI lagging 2-3 minutes behind AB (Fig. 1 n-p). 

In the Fl embryos of worms injected with dsRNAs "300C3" or "340G12", the following 
highly reproducible phenotypes are observed (Fig. 2). First, although the dynamics of 

10 female pronuclear migration appear normal in all cases, its initiation is often somewhat 
delayed. Meeting and apposition of the two pronuclei also typically exhibits defects in that 
the female pronucleus gets captured by only one of the two centrosomes associated with 
the male pronucleus (compare Fig. 2a-c with Fig. la-c). Although this defect is usually 
corrected before pronuclear envelope breakdown is completed, subsequent positioning of 

15 the mitotic spindle within the embryo often appears defective. Weak manifestation of this 
phenotype appears as a lack of rocking of the posterior spindle pole during anaphase, while 
more severe cases show a notable drift of the entire spindle towards the posterior or lateral 
cortex, reaching the cortex itself and losing its longitudinal alignment completely. In the 
latter cases, the strongly aberrant spindle position gives rise to inappropriate specification 

20 of cleavage furrow formation, leading to anomalous cytokinesis. Even in cases where 
spindle position appears relatively normal, positioning of the daughter Nucleus- 
Centrosomes-Complexes (NCCs) typically appears abnormal as soon as anaphase ends and 
the cleavage furrow ingresses. This is often particularly visible in the AB blastomere, 
where the NCC, instead of moving directly to the centre of the cell starting at telophase, 

25 first migrates anteriorly in close proximity to the lateral cortex before eventually centering 
(Fig. 2a-k). This defect is usually accompanied by an apparent absence of interzonal 
spindle microtubules at telophase and a notable bifurcation or forking of the cytokinetic 
cleavage furrow (arrows in Fig. 2 g), leading to aberrantly-sized daughter blastomeres or 
even failure of cytokinesis by complete regression of the furrow (Fig. 2g-m). Nuclear 

30 migration and positioning of the PI nucleus is also aberrant in most cases, resulting in a 
significant delay - or in some cases, a complete failure - in achieving its expected 90° 
rotation and association with the anterior cortex. Division of the PI blastomere is often 



WO 02/38805 



PCT/EP01/13034 



-29- 

significantly delayed in such embryos. Finally, defects in female meiotic divisions are also 
occasionally observed, as evidenced by the presence of multiple female pronuclei, 
indicating a failure to successfully extrude one or both polar bodies, which could come 
from cytokinetic defects similar to those noted above. 

5 All observed phenotypes indicate a requirement for H38K22.2 gene function in the 
microtubule-dependent cellular positioning of NCCs and spindles during mitosis, and 
possibly meiosis. Since this function is essential to cell cycle progression and cell division 
throughout metazoans, this gene and any homologues and derivatives thereof represent 
excellent tools for use in the development of a wide range of therapeutics including anti- 

10 proliferative agents. Analysis of the H38K22.2 gene sequence reveals clear orthologues in 
human (NCBI Accession # AAH09478), mouse (NCBI Accession # AAF04863) and 
Drosophila (NCBI Accession # CG7427) (see Fig. 5), all of which have had no known 
functions ascribed to them until now. Based on their extremely high level of sequence 
conservation at the protein level, it can be concluded that all of these genes most likely 

15 encode proteins with equivalent functions in each of their respective species. The 336 
residue protein encoded by the H38K22.2 gene isoform "a" exhibits no known structural 
motifs or consensus domains, according to either SMART or CDD analyses. 



20 EXAMPLE 4: Characterization of the C elegans gene C02F5.1 

A dsRNA, "307C1", was designed and used to specifically silence the expression of the C. 
elegans gene C02F5.1 by RNAi, thereby testing its functional involvement in the first 2 
rounds of embryonic cell division in this metazoan species. The dsRNA was synthesized in 

25 vitro from a PCR-amplified wild type genomic DNA fragment of the C02F5.1 gene. For 
the PCR, oligonucleotides with sequences "ATCTGAAGATCCGTCCACT" and 
"ATGCACAATGGGTA I I I II " were used as forward and reverse primers, respectively, 
to generate dsRNA M 307C1" which was purified, and injected into adult hermaphrodite 
worms. The phenotypic consequences of the RNAi treatment were documented 24 hours 

30 later in the Fl progeny of injected worms, using time-lapse differential interference 
contrast (DIC) microscopy. Embryo recordings started -20 minutes after fertilisation, 
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while the female pronucleus is completing its meiotic divisions, until the 4 cell stage, -30 
minutes later. 

In the Fl progeny of control worms that were either not injected, or injected with irrelevant 
dsRNA, the cellular events of the first two rounds of embryonic cell division were found to 
5 exhibit very limited variability, as observed by DIC microscopy. All processes that were 
examined and scored for the possibility of phenotypic deviations are listed and illustrated 
in Figure 1. 

Fl embryos from parent worms injected with dsRNA M 307C1" are consistently found to 
10 exhibit the following phenotypes (Fig. 3). First, all cellular processes that are scorable by 
DIC microscopy until entry into mitosis are typically indistinguishable from the wild type 
pattern. These include egg shape and size, yolk granule size and density, yolk granule 
flows and cortical ruffling, pseudo-cleavage furrow formation and positioning, pronuclear 
appearance (arrows in Fig. 3a) and migration (Fig. 3a,b), as well as centration and rotation 
15 of pronuclei (Fig. 3b,c) and associated pair of centrosomes (arrowheads in Fig. 3b,c). 
Formation and positioning of the bipolar mitotic spindle also take place normally, but the 
spindle is most often thinner and less rigid than in wild type, exhibiting aberrant lateral 
bending during its rocking and elongation at anaphase (Fig. 3f-i). After completion of 
cytokinesis, which appears normal, the reforming daughter nuclei are typically tear-shaped, 
20 and remain close to the newly-formed cortex for a prolonged period (Fig. 3a and k). 
Consistent with the tear shape, the two nuclei remain often physically connected by 
anomalous chromatin bridges and karyomeres are also typically seen (asterisks in Fig. 3k 
and 1). This phenotype subsequently results in embryonic lethality in all cases. 

The absence of defects in pronuclear migration and assembly of the bipolar spindle argue 
25 against a role for this gene in more general microtubule functions. The observed defects 
are consistent with a failure in mitotic chromosome segregation, most likely in the 
separation of sister chromatids, resulting in the formation of chromatin bridges, which then 
persist at telophase. The present data therefore indicate an essential requirement for 
C02F5.1 gene function in mitotic chromosome segregation. Since this function is essential 
30 to cell cycle progression and cell division throughout metazoans, this gene and any 
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homologues and derivatives thereof represent excellent tools for use in the development of 
a wide range of therapeutics including antiproliferative agents. 

Analysis of the C02F5.1 sequence reveals that the encoded 1010 residue protein contains 
regions predicted to form coiled coil structures, i.e. likely protein-protein interaction 
5 domains. Sequence homology analyses using the BLASTp program presently reveal no 
clearly orthologous sequences in other organisms. However, considering the essential and 
highly conserved nature of the cellular process in question, functional orthologues of this 
gene/protein are extremely likely to exist in all metazoans, possibly in all eukaryotes, and 
will be identified using for example the methodology as outlined in EXAMPLE 6. 

10 



EXAMPLE 5: Characterization of the C. elegans gene F10E9.8 

15 

Two dsRNAs, "305A12" and "341G5", were designed and used to specifically silence the 
expression of the C elegans gene F10E9.8 by RNAi, thereby testing its functional 
involvement in the first 2 rounds of embryonic cell division in this metazoan species. 
The dsRNAs were synthesized in vitro from PCR-amplified wild type genomic DNA 

20 fragments of the F10E9.8 gene. For PCR, two sets of primer pairs were used: 
"TTCGTCTCGAACACGTATATCCT" with "GAAAGAAGATGAATCAGGCATTG" as 
forward and reverse primers, respectively, to generate dsRNA "305A12", and 
"CTGCAAAAATTATGACTGTGTCG" with "AGCATTCAGATTTGGTTGTCC" as 
forward and reverse primers, respectively, to generate dsRNA "341 G5". The dsRNA was 

25 purified, and injected into adult hermaphrodite worms. The phenotypic consequences of 
the RNAi treatment were documented 24 hours later in the Fl progeny of injected worms, 
using time-lapse differential interference contrast (DIC) microscopy. Embryo recordings 
started -20 minutes after fertilisation, while the female pronucleus is completing its 
meiotic divisions, until the 4 cell stage, -30 minutes later. 

30 In the Fl progeny of control worms that were either not injected, or injected with irrelevant 
dsRNA, the cellular events of the first two rounds of embryonic cell division were found to 
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exhibit very limited variability, as observed by DIC microscopy. All processes that were 
examined and scored for the possibility of phenotypic deviations are listed and illustrated in 
Figure 1. 

In the Fl embryos of worms injected with dsRNAs "305A12" or "341G5", the following 

5 highly reproducible phenotypes are observed (Fig. 4). First, all cellular processes that are 
scorable by DIC microscopy until the 2-cell stage are typically indistinguishable from the 
wild type pattern. These include egg shape and size, yolk granule size and density, yolk 
granule flows and cortical ruffling, pseudo-cleavage furrow formation and positioning, 
pronuclear appearance (arrows in Fig. 4a) and migration (Fig. 4a,b), as well as centration 

10 and rotation of pronuclei (Fig. 4b,c) and associated pair of centrosomes (arrowheads in Fig. 
4b,c). The first round of division also occurs without any detectable deviations from wild 
type (Fig. 4d-h). It should particularly be noted that no defects are observed with respect to 
size, number or positioning of centrosomes or spindle poles in the single cell embryo (note 
arrowheads in Fig. 4b-f). In the two-cell stage embryo, however, although nuclear 

15 positioning also remains equivalent to wild type, an apparent failure in centrosome 
duplication is consistently observed in one of the two blastomeres and sometimes in both. 
A single perinuclear centrosomal region, as seen by its exclusion of yolk granules (black 
arrowhead in Fig. 4h-j), is typically observed instead of the two normally seen both in wild 
type embryos and in the unaffected blastomere (white arrowheads in Fig. 4i j). Despite the 

20 apparent failure in centrosome duplication, microtubule-dependent processes continue 
normally, as illustrated by the successful anterior migration of the PI nucleus, with its 
single centrosomal region leading (black arrowhead in Fig. 4h-j). Upon entering mitosis, as 
scored by nuclear envelope breakdown, the defective blastomere then fails to generate a 
bipolar spindle, forming instead a monopolar array of microtubules (dashed circle in Fig. 

25 4k), as evidenced by the radial alignments of yolk granules in that region. Cytokinesis fails 
to occur in that blastomere, resulting in reformation of multiple, irregularly sized nuclei, 
known as karyomeres (arrows in Fig. 4m,n). In contrast, all aspects of cell division occur 
normally in the neighboring blastomere, resulting in normal daughter cells, each containing 
a single equal-sized nucleus (arrows in Fig. 41). 

30 The complete failure in bipolar spindle formation, accompanied by the presence of a single 
centrosomal region instead of two in the affected two-cell stage blastomere, clearly 
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indicates a requirement for F10E9.8 gene function in the complex process of mitotic 
spindle assembly. However, the lack of detectable defects in other microtubule-dependent 
processes including pronuclear migration and spindle function in the single-cell embryo 
effectively rules out a general microtubule-related function. In view of the maternal nature 
5 of the RNAi effect and the fact that the egg inherits its first centrosome paternally, the 
successful generation of a bipolar spindle in the single-cell embryo further suggests that 
F10E9.8 function may, in fact, be required for some aspect of centrosome duplication or 
separation. 

Indeed, since sperm development is fully completed within the parent before initiation of 
10 the RNAi treatment, it remains unaffected by the injected dsRNA. This results in the 
donation of an intact "wild type" centrosome from the sperm to the egg at fertilisation. 
After fertilisation, this already bipartite centrosome (i.e. containing two "replication units", 
as evidenced by the presence of two centrioles) undergoes one round of duplication, as 
observed in other systems by the budding of a new centriole barrel from each existing 
15 centriole. This is followed by a physical separation of the two centriole pairs and 
associated pericentriolar material. This process is not dependent on the prior duplication 
event, and is solely needed to insure the successful formation of the bipolar spindle to be 
used in the first round of embryonic cell division. It therefore appears that F10E9.8 
function is most likely not required for this process. 

20 5. If the first duplication round fails, however, bipolar spindle formation is expected to fail 
during the second round of division, as seen here. Interestingly, the fact that this failure 
often occurs only in one of the two blastomeres suggests that in these cases only one of the 
original centrosome's two "replication units" actually failed in its first round of duplication 
at the single-cell stage. This observation is consistent with findings from other eukaryotes 

25 indicating that one of the two replication units contained within the sperm's centrosome 
actually comes into the egg already fully equipped for one duplication round, while the 
other must rely on cytoplasmic factors within the egg to permit its own duplication (Sluder, 
G., Hinchcliffe EH. Control of centrosome reproduction: the right number at the right time. 
Biol. Cell. 91, 413-27 (1999). 
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The present findings therefore suggest that the requirement for F10E9.8 function in mitotic 
spindle assembly most likely results from this gene's essential role in the process of 
centrosome duplication. 

5 Since the process of spindle assembly is essential to cell cycle progression and cell division 
throughout metazoans, this gene and any homologues and derivatives thereof represent 
excellent tools for use in the development of a wide range of therapeutics including anti- 
proliferative agents. Analysis of the F10E9.8 sequence reveals that the encoded 1207 
residue protein contains one large region predicted to form coiled coil structures, i.e. likely 

10 protein-protein interaction domains, and four predicted transmembrane domains. Sequence 
homology analyses using the BLASTp program presently reveal no clearly orthologous 
sequences in other organisms. However, considering the essential and highly conserved 
nature of the cellular process in question, functional orthologues of this gene/protein are 
extremely likely to exist in all metazoans, possibly all eukaryotes, and will be identified 

1 5 using for example the following methodology. 



EXAMPLE 6: Protocol for identifying functional orthologues in other species 

20 The present invention describes genes identified as having essential functions in cell 
division in the model organism C. elegans. The basis for performing research in model 
organisms is that the newly discovered functions for the genes in C elegans will be 
conserved in other species including humans. Cell division is highly conserved during 
evolution and therefore the approach of discovering a gene function in C. elegans and 

25 using the information to characterise or assign functions for the human orthologue is well 
justified. There are two themes of conservation of genes during evolution. A gene 
sequence may be conserved. This means that the DNA nucleotide sequence of the gene is 
very similar in different species, which in turn suggests that the function of the gene is the 
same in the different species. As is known to any person skilled in the art, a sequence 

30 identity or homology above a particular level defines that two genes in different species 
code for the same gene product and gene function. Homologous genes are typically 
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identified by performing blast analysis with appropriate software, or by other approaches. 
For a blast search, an e-value of 10" 30 will extract the significant homologous sequences. 
Further phylogenetic analysis can be performed to identify which of the extracted 
sequences are the orthologues. 

5 Therefore the following example for identification of orthologues can be presented. A blast 
search is performed using the blast sequence analysis programs and an e-value of 10" 3 . An 
alternative parameter can be the percentage of sequence identity. Over 100 residues, a 
sequence identity of 30% defines a homologous gene. After the blast search is completed, 
multiple sequence alignment is performed using appropriate software (for example, 
10 CLUSTALW) and a neighbour joining phylogenetic tree is generated. Any person skilled 
in the art can identify the human orthologue from a phylogenetic tree. Essentially, the 
human sequence that is separated on the tree by a single speciation event or most closely 
related on the tree is likely to be an orthologue. 

The second theme of conservation is that the gene function can be conserved with greater 
15 divergence of sequence. In the present invention this theme of conservation is not defined. 
However, if other genes are discovered to have functions that result in the gene product 
being identified as the same gene product as those claimed in the present invention then the 
present claims also apply to such genes. 

20 
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Claims 

1 . An isolated nucleic acid molecule encoding a polypeptide functionally involved in cell 
division and proliferation or a fragment thereof and comprising a nucleic acid 
sequence selected from the group consisting of: 

(a) the nucleic acid sequences presented in SEQ ED NO. 1 to 3, SEQ ID NO. 4 to 5, 
SEQ ID NO. 6 to 7, SEQ ID NO. 12 and fiagments thereof and their 
complementary strands, 

(b) nucleic acid sequences encoding polypeptides that exhibit a sequence identity with 
SEQ ID NO. 8, SEQ ID NO. 9, SEQ ID NO. 10, SEQ ID NO. 1 1 or SEQ ID NO. 
13 of at least 25 % over 100 residues and/or which are detectable in a computer 
aided search using the blast sequence analysis programs with an e-value of at most 

io- 30 , 

(c) nucleic acid sequences which are capable of hybridizing with the nucleic acid 
sequences of (a) or (b) under conditions of medium/high stringency, 

(d) nucleic acid sequences which are degenerate as a result of the genetic code to any 
of the sequences defined in (a), (b) or (c). 

2. A nucleic acid probe comprising a nucleic acid sequence as defined in claim 1 which 
may be a polynucleotide or an oligonucleotide comprising at least 15 nucleotides 
containing a detectable label. 

3. A recombinant vector or nucleic acid construct having incorporated therein the 
isolated nucleic acid molecule of claim 1 or a fragment thereof. 

4. The vector of claim 3 which is an expression vector. 

5. A host cell which has been genetically engineered to incorporate therein the isolated 
nucleic acid molecule of claim 1 or the recombinant vector or nucleic acid construct of 
claim 3. 

6. The host cell of claim 5 having incorporated therein the expression vector of claim 4. 
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7. An assay kit comprising the isolated nucleic acid molecule or a fragment thereof of 
claim 1 or the probe of claim 2 in a suitable container. 

8. A method for producing a polypeptide functionally involved in cell division and. 
proliferation or a fragment thereof in a host cell comprising the steps 

(a) transferring the expression vector of claim 4 into a suitable host cell, and 

(b) cultivating the host cells of step (a) under conditions which will permit the 
expression of said polypeptide or fragment thereof and 

(c) optionally, secretion of the expressed polypeptide into the culture medium. 

9. Use of a probe as defined in claim 2 to isolate orthologues of genes comprising the 
nucleic acid sequences as disclosed in SEQ ID NO. 1 to 3, SEQ ID NO. 4 to 5, SEQ 
ID NO. 6 to 7, SEQ ID NO. 11. 

10. Use of the isolated nucleic acid molecule or a fragment thereof as defined in claim 1 
for producing a polypeptide functionally involved in cell division and proliferation or 
a fragment thereof. 

1 1 . Use of a nucleic acid molecule or a fragment thereof as defined in claim 1 or of the 
probe of claim 2 in a screening assay for interacting drugs that inhibit, stimulate or 
effect the cell division or proliferation. 

12. Use of a nucleic acid molecule as defined in claim 1 or of the probe of claim 2 in a 
method for diagnosis or treatment of diseases associated with anormalous and/or 
excessive cell division or proliferation. 

1 3 . The use of claim 12 wherein the disease is a coronary restenosis or a neoplastic disease 
selected from the group consisting of lymphoma, lung cancer, colon cancer, ovarian 
cancer and breast cancer. 
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14. A polypeptide functionally involved in cell division and proliferation or a fragment 
thereof comprising an amino acid sequence selected from the group consisting of: 

(a) the amino acid sequences depicted in SEQ ID NO. 8, 9, 10, 11 and 13 and 
5 fragments thereof, 

(b) amino acid sequences which exhibit a sequence identity with the sequences of (a) 
of at least 25 % over 100 residues and/or which are detectable in a computer aided 
search using the BLAST sequence analysis programs with an e-value of at most 

io- 30 , 

10 (c) amino acid sequences encoded by any of the nucleic acid sequences (c) - (d) as 

defined in claim 1. 

15. A fusion protein comprising the polypeptide or fragment thereof of claim 14. 



15 16. An antibody or a fragment thereof capable of specifically binding with the polypeptide 
of claim 1 4 or with an immunogenic part thereof. 

17. A humanized antibody capable of specifically binding with the polypeptide of claim 
14 or with an immunogenic part thereof 

20 

18. An assay kit comprising the polypeptide as claimed in claim 14, the fusion protein as 
claimed in claim 15, or the antibodies as claimed in claims 16 and/or 17 in a suitable 
container. 

19. Use of the polypeptide of claim 14, of the fusion protein of claim 15, or of the 
25 antibodies of claims 16 or 17 in a screening assay for interacting drugs that inhibit, 

stimulate or effect the cell division or proliferation. 



20. 



The use of a polypeptide or of an antibody as claimed in claim 19 wherein the 
screening assay for interacting drugs comprises the following steps: 
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1 . recombinant expression of said polypeptide in a host cell 

2. isolation and optionally purification of the recombinantly expressed 
polypeptide of step 1 

3 . optionally labelling of the drugs that are tested to interact with said polypeptide 
and/or labelling of the recombinantly expressed polypeptide 

4. immobilization of the recombinantly expressed polypeptide to a solid phase 

5. binding of a potential interaction partner or a variety thereof to the polypeptide 

6. optionally one or more washing steps 

7. detection and/or quantification of the interaction, in particular by monitoring 
the amount of label remaining associated with the solid phase over background 
levels. 

21. Use of the polypeptide of claim 14, of an amino acid sequence as defined in claim 14 
or of the antibodies of claims 16 or 17 in a method for diagnosis or treatment of 
diseases associated with anomalous and/or excessive cell division or proliferation. 

22. The use of claim 20 wherein the disease is a coronary restenosis or a neoplastic disease 
selected from the group consisting of lymphoma, lung cancer, colon cancer, ovarian 
cancer and breast cancer. 

22. Use of the nucleic acid sequences as defined in claim 1 or the amino acid sequences as 
defined in claim 14 for developing computational models, structural models or other 
models for evaluating drug binding and efficacy. 
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FIG. 1 
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FIG. 2 
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FIG. 3 



WO 02/38805 



43 



PCT/EP01/13034 




FIG. 4 
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Multiple Sequence Alignment of the H38K22.2a family 
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CE61773US.ST25 
SEQUENCE LISTING 

<110> Cenix Bioscience GmbH 

<120> Eukaryotic cell division genes and their use in diagnosis and trea 
tment of proliferative diseases 

<130> CE61773US 

<150> US 60/246,750 
<151> 2000-11-09 

<160> 26 

<170> Patentln version 3.1 

<210> 1 

<211> 3104 

<212> DNA 

<213> C. elegans 

<400> 1 



atgaatcgac 


tgaagtccga 


tcaaaaaaca 


aaggtttgta 


aacggaaaca 


agacgatgaa 


60 


gtggagatga 


gtgatatgga 


aactgatcac 


aaaaagtgta 


gaaaacaaga 


aaacagtaaa 


120 


tttgtgcgtg 


tgaaaattcc 


attcgtcatc 


cattcccgtt 


tttctctttt 


tcagcattta 


180 


tctcgagcaa 


gttcgagttc 


tctagctcaa 


agcactgttc 


tttctgacat 


ttttcccaag 


240 


aactacgata 


atatcgtgag 


ttgtagcggg 


aatttcgaaa 


aaaaaactaa 


ttttgccaca 


300 


tcttgctgct 


tcgtttgtta 


tttcttgact 


agacaaattc 


tagctcatct 


agaaagctga 


360 


cttttctcaa 


aatcgttgcg 


agacccaaag 


cagaaaaatg 


tatctttttt 


aaatctacgt 


420 


ggaaacgcgc 


tccaatatta 


aatttcgagg 


ttttcccgcc 


aaatacctaa 


cgagacccaa 


480 


ctttggcgag 


cagagcgttt 


tgcccgcgat 


tttcctgcgt 


ctcttcaaac 


aatctaatca 


540 


ctgctgctgg 


tttatgaaat 


atcaattttc 


ctcatttttt 


aaagctgagc 


aatgttttcg 


600 


ctcaatccta 


aaatttttag 


tagttctaat 


tgtgatcaac 


ggtttcccat 


ttccgatcga 


660 


agtcactttt 


taaattctca 


cttttattga 


tttttttcgt 


tttgaaattc 


ctgatttctt 


720 


cctttttagt 


gataagacat 


cagttgctga 


ctgtagagaa 


agtgtgagaa 


actgttagtg 


780 


agagagagaa 


aacagtttga 


gaaaatgaaa 


aatgttttaa 


ataatgatat 


cataattatt 


840 


atttgatacc 


atttccagct 


ccggcagttc 


gtccagtgga 


ctcaggtcac 


ggaagctgtg 


900 
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tctctcaact 

%^ L* V- ■» W LA- LA L* 


tcctaacaaa 

V» V-/ V- V^ ^ V^ LA LA LA 


CE61773US.ST25 
agctaattgg aatatcgaat 


acacaataac 


tctatatttc 

W Vx k^. V-J C LA L— L» L^ ^— ' 


960 


cracaatccta 

Vj LA Vv LA LA V- O LA 


atctttttac 

LA V* Vw* L- V- w V • V> VJ V>- 


tggatcgaca 


ccacagccga 


accrt taataa 


at* pcaatcrta 

VA V^* OtAtA L* v — H w LA 


1020 


caGcaattcrc 

w LA VA v LA La L*> U- V»j w 


taacttt aac 

L* Vj Li V/ L*» ^ Imp. VJ VJ V-»* 


aactctacag 


aatgataatg 


ttctcacaat 

V* V* V-»- L- V*» LA >w LA LA V~ 


atttttaatt 

U C C C C C Ci Ci C c 


1080 


aaaatt tatTL 

LA CA Ua L-4 L* L- L- wl Vj V. 


tatatttacfa 

C L4 L- LA L- L* V- LA sA LA 


ctatagaaaa 


aatatttgat 


ttatctcraaa 

^» w LA Lx Vj LA LA LA 


ahacatttta 

Ci c Ci vu c c c c ci 


1140 


t" t* +* pa crt" 1" nrr 

i- L l l*gi y l l vj cj 


a a 1~ a a I - 1" rrna 

CT. CI LOU L L Uvj a 


aaagtgctct 


caaataattg 


1" i"1""l"1"cra PTprr 

C C C C C \~A C4. C4 C4 


cttttttaat 


1200 


i-rrl-l-ppaap'r* 

Ly L LUUQCIU L 


vj aaa l l» a 0. ci y 


ccattttcag 


ataaagcaaa 


tttttttaaa 

c c c c c c c uuu 


nf~ a1~at*papt 

CJ LQLQLCU.CL. 


1260 


aaat i-ft aat- 


t"p+*aaaaaapf 


tattgggaga 


acatgtcaca 


C/ c^ y a. c^ coa. c c. 


c Ly c Ly a a c c 


1320 


ycoyacctaLL 


pfp , apra 3 1- j- -r a 


aatttaatta 


tgtaaataaa 


a n -ha a -hi- 1-1- "h 


y Lay a Ley ay 


1380 




a"hp , apri~a'}~pr+" 
a l c ay u d uy l 


cgacccaaag 


gataaagttg 


PPl PP 3^33 PP 
yaycictciciciv^y 


Aa"hprpTpf^p , p*r < 
a a Lyy yacLL 


1440 

ITT U 


pp r'nnpk a 1* 
uaoyyaa Ltd 


a"hprrt*1-1-rfp , f~ 


cactgatctt 


ggctatgaag 


U LQU LUQ LCU 


cL>y y y l lll l 


1500 


pft"prp"r"prrpr , i- 


PTrr3 ^ rr*t" ffap 
y y aay l l Lav_* 


tgcacagaca 


caatgtgaat 




f- pra a 1* nnnf' n 
LyadLyyy Ly 


1560 


aauy ycia tya 


p^rrpf pf" +■ pa 


agcggatact 


gttcaaaatt 


1" rfa rra paapn 

tyuU QCO.QLU 


aa1"pna1"'l-pn 

CLCL LLUa L L L< CJ 


1620 


d l LaaL IC-ay 


ydcuyyddLC 


ggataaggca 


aaagtacgga 


aGaaa L, Laaa 


LaaLLyydd l 






dCLLdLLLyd 


aagtgggaga 


gcgaatttgc 


a r1~1-t M- aaa 


aapaaatfpa 
aaL-aaa L LLd 


1 740 






agttaattga 


aaaattttga 


■f- /-f ■}- 3 a 2 2 i" 3 r 

Ly LddaaLat 


arf2/iaaaaaf 

dyayddadd L 


1 ROD 


LaUaL-dLL LL 


LCOLcydyyd 


gtacacgggc 


tgcgtaaatc 


ddLdLdLdyt 


l l id Lty l uy 


X O D U 


y l. tuauctuua 


P , PTPTP'.£lPf"l~,a"t~PT 

L-y y Lay LaLvj 


ataatcaaaa 


aaaaaattta 


a"f"hpfa^a^^+" 

CX C- Ly CLCLCX CLCL L. 


■Hrr^ipa't-'t-^^tpr 
l y aaa. l l a ay 


1920 


at-frrrarrrraaa 
a - y y a y y a a a 


a1~n"t"1~a1-i-1-p 


gatctggaaa 


taatatttat 


j_ j_ i_ j_ x. 

u c u c- y Ly aaa 


a1~"t"aa1"aaa1- 
a l Laa Laaa l 


1980 


a t" a a t* t" t* 1" pa 

CI L Ci Ci C C l — L L» CI 


narTTraarrrra 

yaeLyaayy ci 


aaattttaat 


acgtttctat 


a a c a a c l, c l. 0 


pf a 1 - 1 - r a a a a a 

yd l l c ciciG aci 


2040 


tttaaattat 


uaucia L L L L L 


aaaaacaaaa 


aggttctacg 


d LL>y L C t CU c. 


a1" cf aat" af p 


2100 


ttatcaatta 

V» V* V* L-- w> LA VJ V- V— LA 


cacrttccaccr 

L* Ci y C L C* WU e 


agctctacct 


atttgccttc 


aact atcrcca 

CC UvLU C V-j C* VU 


aat ccoccac 

LA LA UL/WM^w^Mv 


2160 


ttgccgcaat 


ctggatcttg 


aaactgccat 


ctgttgctgg 


gatgttcttt 


teggacaacg 


2220 


ct LcaaL>aa L L 


d L.y du UCdd L 


ggatcgattt 


tctatgggca 


cdyydyddcy 


r*PTrtP"pfPfP , pf , t"r^ 
cyycyycy lv 


<£. U W 


tcgcctcgct 


cagaacgtgg 


gcgcttccaa 


tgcgaagcaa 


ttcaaategg 


tgtggatctc 


2340 


tcgtgacacg 


tggaatctct 


tctgggactt 


tattcttctg 


agtaagccag 


atttgtcgga 


2400 


ttacgatgat 


gaaggagcat 


ggccagtgct 


tattgatcaa 
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ttcgttgatt 


attgccgtga 


2460 
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aaatctcaat 


tatccaaagc 
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caggaaatgc gtcaaatgat 


cagcaaatgg agacaccaag 


2520 


ttattattag gacaaacaat 


tctaaaatcc 


taaaggttcg 


tgtttcccca atttcttcct 


2580 


attttcagag ttataaaata 


ttgcctggac 


gcgaaatttt 


gcttcaaaac tacggtacca 


2640 


ggtctcggca cgacaaatat 


tggttaaatg 


cgaaaatgca 


cgcgccttca atgggtactg 

Z> Zf ZJ ZJ ZJ ZJ 


2700 


tagtttcaca cttttcaaaa 


cgttaatttt 


tctatgacaa 


cagataagct ttaaaaaatc 


2760 


ttgtgaaaaa 


cttcaaaaaa 


tcaaaacrttt 

L*> V— * LA LA CA LA L* \*» \** 


a a acrn rnr a c 


atattttaac aaaaaatgtt 


2820 


tcgtgccgag accggctacc 


gtatttttta 


tgcgaaattt 


cgcgtttgtg taatattttt 


2880 


atattatacc gagaaaactc 


gacactttaa 


aggtgtggta 


gcgaattggg attttatttc 


2940 


gaaaaatatc ctaaatattc 


ccaaattcag 


aaatagcgca 


aaagaaaccc ggaatttttt 


3000 


attttaattc 


taatttacaa 


ctaatagaat 


tcaaattgtt 


tcacrtatccc atcrctcaacra 

L- V> LA \-J LA O \^ V_^- LA L- \A l_- LA CA x-j LA 


3060 


ctatcttcaa 


aataacaatt 


cacaccgccg 


gaacaaatcg 


ataa 


3104 


<210> 2 
<211> 1011 
<212> DNA 
<213> C. elegans 










<400> 2 
atgaatcgac 


tgaagtccga 


tcaaaaaaca 


aagctccggc 


agttcgtcca gtggactcag 


60 


gtcacggaag 


ctgtgtctct 


caacttcctg 


gcaaaagcta 


attggaatat cgaatacgcg 


120 


atgactctgt 


atttcgacaa 


tcctaatctt 


tttgctggat 


cgacaccaca gccgagcgtt 


180 


gataggtcca 


atatcgagcg 


cctcttcaat 


cagtatgtcg 


acccaaagga taaagttgga 


240 


gaaaaacgaa 


tgggacccca 


cggaatcaat 


cgtttgctca 


ctgatcttgg ctatgaagct 


300 


actgatcgcc 


gggttcttgt 


gctcgcctgg 


aagtttactg 


cacagacaca atgtgaattc 


360 


tcgttggatg 


aatgggtgaa 


aggaatgaca 


gctcttcaag 


cggatactgt tcaaaatttg 


420 


agacaacgaa 


tcgattcgat 


taattcagga 


ctggaatcgg 


ataaggcaaa attccacgag 


480 


ctctacctat 


ttgccttcaa 


ctatgccaaa 


tccgccgctt 


gccgcaatct ggatcttgaa 


540 


actgccatct 


gttgctggga 


tgttcttttc 


ggacaacgat 


caacaattat gactcaatgg 


600 


atcgattttc 


tatgggcaca 


ggagaacgcg 


gcggcgtctc 


gcctcgctca gaacgtgggc 


660 


gcttccaatg 


cgaagcaatt 


caaatcggtg 


tggatctctc 
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gtgacacgtg gaatctcttc 


720 
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tgggacttta ttcttctgag 


taagccagat 


ttgtcggatt 


acgatgatga 


aggagcatgg 


780 


ccagtgctta ttgatcaatt 


cgttgattat tgccgtgaaa 


atctcaatta 


tccaaagcca 


840 


ggaaatgcgt caaatgatca 




acaccaaaaa 


tagcgcaaaa 


gaaacccgga 


900 


attttttatt 


ttaattctaa 


i~ t* t" a na a c i~ a 


a t" p\ n a a 1 - 1 ca 


aattgtttca 


gtatcccatg 


960 


ctcaagacta tcttcaaaat 


aacaattcac 


accgccggaa 


caaatcgata 


a 


1011 


<210> 3 
<211> 852 
<212> DNA 
<213> C. elegans 












atgaatcgac 


tgaagtccga 


tcaaaaaaca 


aagatcgagc 


gcctcttcaa 


tcagtatgtc 


60 


gacccaaagg 


ataaagttgg 


agaaaaacga 


atgggacccc 


acggaatcaa 


tcgtttgctc 


120 


actgatcttg 


gctatgaagc 


tactgatcgc cgggttcttg 


tgctcgcctg 


gaagtttact 


180 


gcacagacac 


aatgtgaatt 


ctcgttggat 


gaatgggtga 


aaggaatgac 


agctcttcaa 


240 


gcggatactg 


ttcaaaattt 


gagacaacga 


atcgattcga 


ttaattcagg 


actggaatcg 


300 


gataaggcaa 


aattccacga 


gctctaccta 


tttgccttca 


actatgccaa 


atccgccgct 


360 


tgccgcaatc 


tggatcttga 


aactgccatc tgttgctggg 


atgttctttt 


cggacaacga 


420 


tcaacaatta 


tgactcaatg 


gatcgatttt 


ctatgggcac 


aggagaacgc 


ggcggcgtct 


480 


cgcctcgctc 


agaacgtggg 


cgcttccaat 


gcgaagcaat 


tcaaatcggt 


gtggatctct 


540 


cgtgacacgt 


ggaatctctt 


ctgggacttt 


attcttctga 


otaaoccaaa 


tttcrtcaaat 

\r* I*- 1*- \J 1— 


600 


tacgatgatg 


aaggagcatg 


gccagtgctt attgatcaat 


tcgttgatta 


ttgccgtgaa 


660 


aatctcaatt 


atccaaagcc 


aggaaatgcg 


tcaaatgatc 


agcaaatgga 


gacaccaaaa 


720 


atagcgcaaa 


agaaacccgg 


aattttttat 


tttaattcta 


atttacaact 


aatagaattc 


780 


aaattgtttc 


agtatcccat 


gctcaagact 


atcttcaaaa 


taacaattca 


caccgccgga 


840 


acaaatcgat 


aa 










852 



<210> 4 
<211> 3308 
<212> DNA 
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<213> C. elegans 
<400> 4 



atgtcgatgg 


agcctcgtaa 


gaagcggaac 


tcgattctca 


aqgtqcqqca 


ageegtcgaa 


60 


accatcgagg 


aaaccgtcat 


gaacagtggg 


cctagttcca 


caacaactaa 


tcgacgagtc 


120 


agctttcata 


acgtgaagca 


tgtcaagtca 


gttagagtca 


gtgaataatt 


tatcaataaa 


180 


ataattattt 


caggcagtat 


qacaqqqacc 


atggtaaaat 


tcttgacgcc 


acaccagtta 


240 


aggagaagat 


tactgacact 


attggatcag 


atggtatttt 


gacgtgagtt 


ccatccttta 


300 


acgtgaaata 


ataaatacat 

LA L» LA LA LA L#* LA L-J L* 


aaaaat ct tt 

LA LA LA LA LA L» L-* L* L* L* 


ttaaaaccac 

L- L- LA LA LA LA w V—* LA V> 


atfjcrccrcjaaa 


pa tern a tat t 


360 


tccgaatct c 


caacctapac 


crt cptcattt 

LA L-r L^> L- L*/ LA L* L* L* 


paacrt nt tpn 


cfpocrt ocft a a 


t pt rnafaaa 


420 


actatocrata 

LA L*» L* LA VA v_A L* V-A 


t" Cf t" p 1" p t" raa 


aaraarfatr 

CA U UUCi V — U- Ci C v*> 


aauyay a, a ci 


a^yciCLCiV^yy^ 


rf^nattrrttt 


480 


craaaccacaa 

^ t-4 *■* LA V-» \^ LA LA 


crana frraar 


a pt a 1 1 at a c 


y uciciQy a i_. y 


t pnsa^ppap 


^.RPr^^^P^tt 

ci a. \ — ci ct a. ci y l, u. 


540 


acccraaccraa 


ttattaatat 


rrppapt crcrat 
y v— » v.* ^ y y ^ *^ 


a at a crt ^ an 

y o lulvl Lay 


raaf rrtt paa 

v-> uu uy C 1— v CJ. CI 


f araarnaai" 

i— ci v^. ci ci v_y y ci ci l. 


600 


raaaaaaata 

V^c HUVJU CI y u L, CL 


ay y a lo. l y l. 


^rrttrr^tprrt 


t paatt rttt 


tp^prrsttpp 


uaaay u u^^y 


\J VJ \J 


aaacataaca 

LA LA LA V—* LA L* LA LA L-* ^ 


r 1" a p a a t cr a a 


t acraapt ai~ a 


p p a a t aria p p 


t pna t n3 at r 

u v^y o l y q a l. 


ddaciyi^ciy o y 


720 


ggcggccagt 


acaataaaac 

LA L> LA LA L- LA LA LA LA 


aatatcrttcra 

LA VA L- LA L- LA L** L^ VA LA 


attaataaaa 

LA L*» L> LA LA L~ LA VA LA LA 


aaaaccaaat 

LJ LA LA LA L^ LA LA LA L~ 


tatcttaatt 

L-p LA L^ L^. L-- LA LA L^ L* 


780 


ttacagatga 


atgtgttcaa 


tttcacaaac 


ttaaaaacca 

L#* L* ^A LA LA VA LA 


ctgaaatgga 


tacaaataaa 

L* LA L*** LA LA LA L» LA LA LA 


840 


ttagatgaaa 


ataataccat 


gaatgetate 


eggattcega 


ttaattcaaa 


cgtcatgcct 


900 


gtagacatgg 


acatcactga 


acatcacact 


ttaattgaag 


aaaagaaaaa 


tgatacattc 


960 


qqqccaaqtc 


aactgatgga 


cattteggeg 


ccacaagttc 


aagttaatga 


tactttggee 


1020 


attttcaaca 


gtccgagaga 


catctgtaat 


aagggtttgg 


gtgttcctca 


gaatct aata 


1080 


aatatcgcct 


cgaacgtcgt 


acctgtggac 


atggacatca 


ctgatcaggc 


cgtattaaac 


1140 


gcggagaaga 


aaaatgatca 


attcgagaca 


agtcagctta 


tggacatttc 


tattccgaaa 


1200 


gttctagtaa 


atgacactat 


ggcgatgttc 


aacagcccga 


aacaegtcag 


taagagcagc 


1260 


atggatctcg 


agaaaacgat 


tgaagecget 


gacaaatcaa 


cgaaataccc 


gagtatcgea 


1320 


gatgaggtgg 


aagatttaga 


catggatatg 


gatatcactg 


aacaacaacc 


atgtgaggct 


1380 


ggtaatcagc 


agaacgacgg 


ettgeaaett 


caaaaggagg 


atttaatgga 


cattteggtg 


1440 


attcgagatt 


cacctgcagt 


aaacgacacc 


atggctgtgt 
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tccagagtcc 


tgccagagta 


1500 
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aagatcggag 


cggtaagttt 


taagcacact 


ttccaataaa 


aatgtatttc 


tttcagaaca 


1560 


actcgatcat 


tgattcgcag 


aaatctatcg 


tgttcggtga 


cgaaatgagc 


attgacgaga 


1620 


cacaaaatga 


tggaaccttg 


acgttgccaa 


agtcgaatgt 


agaagtgact 


acaactaatg 


1680 


atgtctacac 


qtctctcqaq 


eggcaagagg 


aaaatgette 


agaaaaegta 


tccatgataa 


1740 


acgaatcttc 


tgttcattcg 


gaaatcgaca 


aaaagtcgtt 


tatgetcate 


gaagaagaaa 


1800 


aacrcttttat 

\A v_-* U U U LA L* 


gcactcctcc 


atgattgatg 


tagcacaaaa 


qttqqaaqac 


gatggttcgt 


1860 


cgaagacgcc 


agtcatcctt 


gcttcacagt 


cagcttctct 


tgccactaaa 


gaaccatcag 


1920 


cccttcacaa 


etcgagtgea 


actctcaaca 


attcgatgga 


attggacaac 


aatactcttc 


1980 


ttaaaactat 

W ^ LA LA ^4 I* 


gcaaattaca 


acgtgtgaag 


acattagcat 


qqtccatqaq 


tetattgetg 


2040 


ttaaactcraa 

I— » \-> \^ LA LA V— ' \~ LA LA 


cagtaacaaa 


gagcaggagc 


aatteggaga 


tgagactttg 


cagaaaaatg 


2100 


crtaaatttccf 


tttattcaat 


aactctatta 


aaagtatgtt 


ttagatacct 


cgaatactgg 


2160 


cacaaatttc 

V-» L^ \-4 LA LA L^ L- * 


acattccaag 


gecataatga 


aacategcaa 


atcatgaaca 


atgtcgactc 


2220 


an a a cr c a erf" cr 


aacacgt cca 


agatttcaac 


atattegget 


ttcaatttga 


gcatcaacca 


2280 


Qtctatct ct 


aaacgacgtc 


gatctcttct 


gaattctget 


cgtgaatctc 


ctcgtcgtgt 


2340 


tgcgttggag 


aattctataa 


tgtcgatgaa 


tgggcaaaca 


atggaagctc 


tgacagaata 


2400 


tcgacagaat 


aaaactatgc 


agacgagtca 


agattcgatg 


ccgagtatga 


gtttgaacga 


2460 


ttcqaaaaaa 


gatattctcg 


cgatggtaag 


aatatctctt 


tgagtattga 


atcgaaaatg 


2520 


tctttcagaa 


tacatcagtc 


cgctctcctc 


atctgaattc 


ttcaaaaact 


gctgccccag 


2580 


gaacaccatc 


attgatgtca 


caaaatgtac 


aacttccacc 


tccatctcct 


caattcgaaa 


2640 


tgecagaett 


cgatccagct 


gtqqtcaacg 

3 3 3 3 


ttgtatattt 


aacatctgaa 


gatccgtcca 


2700 


ctgaacaaca 


tccagaagct 


ctcaaatttc 


agegtattgt 


tgaaaacgag 


aaaatgaaag 


2760 


tacaacacga 


gattgattct 


ctgaattcaa 


ccaatcaact 


ttctgetgag 


aaaattgata 


2820 


tgttgaagac 


taaggagctc 


ttgaagttta 


gtcatgatga 


gcgagaagcg 


attatgattg 


2880 


caagaaaaga 


cgeggaaate 


aagtttttgg 


agettegtet 


gaaatttgea 


ctcgagaaaa 


2940 


aaattgaaag 


tgaccaggaa 


attgetgaac 


tagaacaagg 


aaattcgaaa 


atggctgagc 


3000 


agctaagagg 


tctcgataag 


atggctgtcg 


ttcaaaaaga 
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actagaaaag 


ctgagaagtc 


3060 
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ttcctccatc 


acgcgaagag 


agcgggaaaa 


tccgaaagga 


gtggatggag 


atgaagcaat 


3120 


gggaattcga ccagaaaatg 


aaagcactcc 


gaaatgtacg 


ctcaaacatg 


attgcacttc 3180 


gttcagagaa aaatgctctc 


gaaatgaaag 


tcqcqqaaqa 


acacgagaag 


tttgcccaga 


3240 


ggaacgattt 


gaagaaaagt 


cgaatgctgg 


tgttctctaa 


ggctgttaag 


aaaattgtga 


3300 


acttctag 












3308 


<210> 5 
<211> 3033 
<212> DNA 
<213> C. elegans 












<400> 5 
atgtcgatgg 


agcctcgtaa 


rra p rrpnrra a c 


t" r n a t" t" p ir a 


a ant" acnacri 

la yy v — ■ y \u 


agccgtcgaa 


60 


accatcgagg 


aaaccgtcat 


rra aCrlClf' Cinn 
y aaL/Oy j j j 


crt* ao1~ i~ rca 

\— « \— ' I — CJ. \_j 1 — l — w Vw* LA 


caacaactaa 


tcgacgagtc 


120 


agctttcata 


acgtgaagca 


trrl* CriaacAa 

1 — V-J I— . CI CA ' t-A M 


t* a t* a a c a on a 

LA yj LA Vw, CJ. y y^ va 


acca tcrcrtaa 

LA V-^ LA L* LA LJ L- LA LA 


aattcttgac 


180 


gccacaccag 


l caaggagaa 


rr^ 1" 1" 3 ct na c 


a nt~ a t" 1" ncrat 

LA L. LA L» 1— yj ^ LA L» 


cacratootat 

LA LJ LA L-i NA XA L^ LA L^ 


tttgacacca 


240 


cgtggcggaa 


acatggatat 


t* t* ccaaatct 

V— ' \A U U V— l— 


ccaacct crca 

w ' ^ V_> V-^ U> y V-» LA 


catcctcatt 

\J L» ' L^ LA L* 


tcaagtgttc 


300 


ggcggtggta 


atctcgataa 


aactataaat 

LA LA N^- LA L* %J y LA L- 


atcrt ctctco 


aaacaactat 


caacgagaac 


360 


aacgaaacgg 


cgagattgtt 


L M Q □ CA • wtA l_«CA 


ananai - rraa 

ca y^ la y4 la v- v-- v^la la 


cactattata 


cgaaaagatc 


420 


gtcgaaacca 


caacaaaagt 


taccaaocca 

L* LA LA LA \_j LA 


attattaata 


tgccactgga 


tgatacctta 


480 


gcaatgttca 


atacaacgaa 


tcaaaaacrat 

U» LA LA ^ LA M ^-A 


aaaaatatcrt 


caatt oat cq 

\*t LA LJ L~ L«> VA LA L* L- * L^ 


ttcagttctt 


540 


ttcacgattc 


ccaaagttcc 


aaaacataac 

LA LA \^ V— ' LA V- <— 1 LA 


actacaatoa 

y V_* L- LA LA LA I— ^ tA 


ataaaactat 

LA L»# LA \-4 LA LA L«* LA L- 


acccratcrcrac 

C4. t yj la <— yj yj c_-i v — 


600 


ctcgatgaat 


caaaagcagc 


aaaccfCfccacr 

yyy v^y ^ v-* v«< la vj 


taccratcraaa 

L» W W LA L* ^ LA V—A LA 


caataaatat 

Lp^ >A LA L^ \J LA LA L* LJ I— » 


gttcaatttc 


660 


acaaacttgg 


aagccgctga 


aatggatacg 


agtaaattag 


atgaaaataa 


taccatgaat 


720 


gctatccgga 


ttccgattaa 


ttcaaacgtc 


atgcctgtag 


acatggacat 


cactgaacat 


780 


cacactttaa 


ttgaagaaaa 


gaaaaatgat 


acattcgggc 


caagtcaact 


gatggacatt 


840 


tcggcgccac 


aagttcaagt 


taatgatact 


ttggccattt 


tcaacagtcc 


gagagacatc 


900 


tgtaataagg 


gtttgggtgt 


tcctcagaat 


ctaataaata 


tcgcctcgaa 


cgtcgtacct 


960 


gtggacatgg 


acatcactga 


tcaggccgta 


ttaaacgcgg 


agaagaaaaa 


tgatcaattc 


1020 
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Glu Lys Arg Met Gly Pro His Gly lie Asn Arg Leu Leu Thr Asp Leu 
85 90 95 



Gly Tyr Glu Ala Thr Asp Arg Arg Val Leu Val Leu Ala Trp Lys Phe 
100 105 110 



Thr Ala Gin Thr Gin Cys Glu Phe Ser Leu Asp Glu Trp Val Lys Gly 
115 " 120 125 



Met Thr Ala Leu Gin Ala Asp Thr Val Gin Asn Leu Arg Gin Arg lie 
130 135 140 



Asp Ser lie Asn Ser Gly Leu Glu Ser Asp Lys Ala Lys Phe His Glu 
145 150 155 160 



Leu Tyr Leu Phe Ala Phe Asn Tyr Ala Lys Ser Ala Ala Cys Arg Asn 
165 170 175 



Leu Asp Leu Glu Thr Ala lie Cys Cys Trp Asp Val Leu Phe Gly Gin 
180 185 " 190 



Arg Ser Thr He Met Thr Gin Trp He Asp Phe Leu Trp Ala Gin Glu 
195 200 205 



Asn Ala Ala Ala Ser Arg Leu Ala Gin Asn Val Gly Ala Ser Asn Ala 
210 215 220 



Lys Gin Phe Lys Ser Val Trp He Ser Arg Asp Thr Trp Asn Leu Phe 
225 230 235 * 240 



Trp Asp Phe He Leu Leu Ser Lys Pro Asp Leu Ser Asp Tyr Asp Asp 
245 250 1 "* 255 



Glu Gly Ala Trp Pro Val Leu He Asp Gin Phe Val Asp Tyr Cys Arg 
260 265 270 



Glu Asn Leu Asn Tyr Pro Lys Pro Gly Asn Ala Ser Asn Asp Gin Gin 
275 280 285 
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Met Glu Thr Pro Lys lie Ala Gin Lys Lys Pro Gly lie Phe Tyr Phe 
290 1 295 ' 300 



Asn Ser Asn Leu Gin L*eu lie Glu Phe Lys Leu Phe Gin Tyr Pro Met 
305 310 315 ^ 320 



Leu Lys Thr He Phe Lys He Thr He His Thr Ala Gly Thr Asn Arg 
325 330 335 



<210> 9 

<211> 283 

<212> PRT 

<213> C. elegans 

<400> 9 

Met Asn Arg Leu Lys Ser Asp Gin Lys Thr Lys He Glu Arg Leu Phe 
1 5 10 15 



Asn Gin Tyr Val Asp Pro Lys Asp Lys Val Gly Glu Lys Arg Met Gly 
20 25 30 



Pro His Gly He Asn Arg Leu Leu Thr Asp Leu Gly Tyr Glu Ala Thr 
35 40 45 



Asp Arg Arg Val Leu Val Leu Ala Trp Lys Phe Thr Ala Gin Thr Gin 
50 55 " 60 



Cys Glu Phe Ser Leu Asp Glu Trp Val Lys Gly Met Thr Ala Leu Gin 
65 70 75 80 



Ala Asp Thr Val Gin Asn Leu Arg Gin Arg He Asp Ser lie Asn Ser 
85 90 95 



Gly Leu Glu Ser Asp Lys Ala Lys Phe His Glu Leu Tyr Leu Phe Ala 
100 * 105 " 110 



Phe Asn Tyr Ala Lys Ser Ala Ala Cys Arg Asn Leu Asp Leu Glu Thr 
115 120 125 
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Ala He Cys Cys Trp Asp Val Leu Phe Gly Gin Arg Ser Thr He Met 
130 135 140 



Thr Gin Trp He Asp Phe Leu Trp Ala Gin Glu Asn Ala Ala Ala Ser 
145 150 155 160 



Arg Leu Ala Gin Asn Val Gly Ala Ser Asn Ala Lys Gin Phe Lys Ser 
165 170 175 



Val Trp He Ser Arg Asp Thr Trp Asn Leu Phe Trp Asp Phe He Leu 
180 ^ ' 185 190 



Leu Ser Lys Pro Asp Leu Ser Asp Tyr Asp Asp Glu Gly Ala Trp Pro 
195 200 205 



Val Leu He Asp Gin Phe Val Asp Tyr Cys Arg Glu Asn Leu Asn Tyr 
210 215 ' 220 



Pro Lys Pro Gly Asn Ala Ser Asn Asp Gin Gin Met Glu Thr Pro Lys 
225 230 235 240 



He Ala Gin Lys Lys Pro Gly He Phe Tyr Phe Asn Ser Asn Leu Gin 
245 250 255 



Leu He Glu Phe Lys Leu Phe Gin Tyr Pro Met Leu Lys Thr He Phe 
260 265 270 



Lys He Thr lie His Thr Ala Gly Thr Asn Arg 
275 280 



<210> 


10 


<211> 


1010 


<212> 


PRT 


<213> 


C. e 


<400> 


10 



Met Ser Met Glu Pro Arg Lys Lys Arg Asn Ser He Leu Lys Val Arg 
15 10 15 
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Gin Ala Val Glu Thr He Glu Glu Thr Val Met Asn Ser Gly Pro Ser 
20 25 30 



Ser Thr Thr Thr Asn Arg Arg Val Ser Phe His Asn Val Lys His Val 
35 ^ 40 45 



Lys Gin Tyr Asp Arg Asp His Gly Lys He Leu Asp Ala Thr Pro Val 
50 1 ~ 55 "* 60 



Lys Glu Lys He Thr Asp Thr He Gly Ser Asp Gly He Leu Thr Pro 
65 70 75 80 



Arg Gly Gly Asn Met Asp He Ser Glu Ser Pro Ala Cys Thr Ser Ser 
85 90 95 



Phe Gin Val Phe Gly Gly Gly Asn Leu Asp Lys Thr Met Asp Met Ser 
100 105 110 



Leu Glu Thr Thr He Asn Glu Asn Asn Glu Thr Ala Arg Leu Phe Glu 
115 120 125 



Thr Thr Arg Asp Pro Thr Leu Leu Tyr Glu Lys He Val Glu Thr Thr 
130 " 135 140 



Thr Lys Val Thr Glu Arg He Val Ser Met Pro Leu Asp Asp Thr Leu 
145 150 155 160 



Ala Met Phe Asn Thr Thr Asn Gin Glu Asp Lys Asp Met Ser Val Asp 
165 170 175 



Arg Ser Val Leu Phe Thr He Pro Lys Val Pro Lys His Asn Ala Thr 
180 185 190 



Met Asn Arg Thr He Pro Met Asp Leu Asp Glu Ser Lys Ala Ala Gly 
195 200 205 



Gly Gin Cys Asp Glu Thr Met Asn Val Phe Asn Phe Thr Asn Leu Glu 
210 215 ■ 220 
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Ala Ala Glu Met Asp Thr Ser Lys Leu Asp Glu Asn Asn Thr Met Asn 
225 230 235 240 



Ala lie Arg lie Pro lie Asn Ser Asn Val Met Pro Val Asp Met Asp 
245 250 255 



He Thr Glu His His Thr Leu He Glu Glu Lys Lys Asn Asp Thr Phe 
260 265 270 



Gly Pro Ser Gin Leu Met Asp He Ser Ala Pro Gin Val Gin Val Asn 
275 280 285 



Asp Thr Leu Ala He Phe Asn Ser Pro Arg Asp He Cys Asn Lys Gly 
290 295 300 



Leu Gly Val Pro Gin Asn Leu He Asn He Ala Ser Asn Val Val Pro 
305 310 315 320 



Val Asp Met Asp He Thr Asp Gin Ala Val Leu Asn Ala Glu Lys Lys 
325 .330 335 



Asn Asp Gin Phe Glu Thr Ser Gin Leu Met Asp He Ser He Pro Lys 
340 345 350 



Val Leu Val Asn Asp Thr Met Ala Met Phe Asn Ser Pro Lys His Val 
355 360 365 



Ser Lys Ser Ser Met Asp Leu Glu Lys Thr He Glu Ala Ala Asp Lys 
370 375 380 



Ser Thr Lys Tyr Pro Ser He Ala Asp Glu Val Glu Asp Leu Asp Met 
385 390 395 400 



Asp Met Asp He Thr Glu Gin Gin Pro Cys Glu Ala Gly Asn Gin Gin 
405 410 415 



Asn Asp Gly Leu Gin Leu Gin Lys Glu Asp Leu Met Asp He Ser Val 
420 425 430 



Seite 21 



WO 02/38805 



PCT/EP01/13034 



CE61773US.ST25 

He Arg Asp Ser Pro Ala Val Asn Asp Thr Met Ala Val Phe Gin Ser 
435 440 445 



Pro Ala Arg Val Lys He Gly Ala Asn Asn Ser He He Asp Ser Gin 
450 455 460 



Lys Ser He Val Phe Gly Asp Glu Met Ser He Asp Glu Thr Gin Asn 
465 470 475 480 



Asp Gly Thr Leu Thr Leu Pro Lys Ser Asn Val Glu Val Thr Thr Thr 
485 490 495 



Asn Asp Val Tyr Thr Ser Leu Glu Arg Gin Glu Glu Asn Ala Ser Glu 
500 505 510 



Asn Val Ser Met He Asn Glu Ser Ser Val His Ser Glu He Asp Lys 
515 520 525 



Lys Ser Phe Met Leu He Glu Glu Glu Arg Ala Phe Met His Ser Ser 
530 535 540 



Met He Asp Val Ala Gin Lys Leu Glu Asp Asp Gly Ser Ser Lys Thr 
545 " 550 555 560 



Pro Val He Leu Ala Ser Gin Ser Ala Ser Leu Ala Thr Lys Glu Pro 
565 570 575 



Ser Ala Leu His Asn Ser Ser Ala Thr Leu Asn Asn Ser Met Glu Leu 
580 585 590 



Asp Asn Asn Thr Leu Leu Lys Thr Met Gin He Thr Thr Cys Glu Asp 
595 600 605 



He Ser Met Val His Glu Ser He Ala Val Glu Leu Asn Ser Asn Lys 
610 615 620 



Glu Gin Glu Gin Phe Gly Asp Glu Thr Leu Gin Lys Asn Asp Thr Ser 
625 630 635 640 
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Asn Thr Gly Ala Asn Phe Thr Phe Gin Gly His Asn Glu Thr Ser Gin 
645 650 655 



He Met Asn Asn Val Asp Ser Glu Ala Val Asn Thr Ser Lys He Ser 
660 665 670 



Thr Tyr Ser Ala Phe Asn Leu Ser He Asn Gin Ser He Ser Lys Arg 
675 680 685 



Arg Arg Ser Leu Leu Asn Ser Ala Arg Glu Ser Pro Arg Arg Val Ala 
690 695 700 



Leu Glu Asn Ser He Met Ser Met Asn Gly Gin Thr Met Glu Ala Leu 
705 710 715 720 



Thr Glu Tyr Arg Gin Asn Lys Thr Met Gin Thr Ser Gin Asp Ser Met 
725 730 735 



Pro Ser Met Ser Leu Asn Asp Ser Gly Arg Asp He Leu Ala Met Asn 
740 745 750 



Thr Ser Val Arg Ser Pro His Leu Asn Ser Ser Lys Thr Ala Ala Pro 
755 760 765 



Gly Thr Pro Ser Leu Met Ser Gin Asn Val Gin Leu Pro Pro Pro Ser 
770 775 780 



Pro Gin Phe Glu Met Pro Asp Phe Asp Pro Ala Val Val Asn Val Val 
785 790 " . 795 800 



Tyr Leu Thr Ser Glu Asp Pro Ser Thr Glu Gin His Pro Glu Ala Leu 
805 810 815 



Lys Phe Gin Arg He Val Glu Asn Glu Lys Met Lys Val Gin His Glu 
820 825 830 



He Asp Ser Leu Asn Ser Thr Asn Gin Leu Ser Ala Glu Lys He Asp 
835 840 845 
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Met Leu Lys Thr Lys Glu Leu Leu Lys Phe Ser His Asp Glu Arg Glu 
850 ~ 855 860 



Ala He Met He Ala Arg Lys Asp Ala Glu He Lys Phe Leu Glu Leu 
865 870 875 880 



Arg Leu Lys Phe Ala Leu Glu Lys Lys He Glu Ser Asp Gin Glu He 
885 890 895 



Ala Glu Leu Glu Gin Gly Asn Ser Lys Met Ala Glu Gin Leu Arg Gly 
900 905 910 



Leu Asp Lys Met Ala Val Val Gin Lys Glu Leu Glu Lys Leu Arg Ser 
915 920 925 



Leu Pro Pro Ser Arg Glu Glu Ser Gly Lys He Arg Lys Glu Trp Met 
930 935 940 



Glu Met Lys Gin Trp Glu Phe Asp Gin Lys Met Lys Ala Leu Arg Asn 
945 " 950 955 960 



Val Arg Ser Asn Met He Ala Leu Arg Ser Glu Lys Asn Ala Leu Glu 
- 965 970 975 



Met Lys Val Ala Glu Glu His Glu Lys Phe Ala Gin Arg Asn Asp Leu 
980 985 990 



Lys Lys Ser Arg Met Leu Val Phe Ser Lys Ala Val Lys Lys He Val 
995 * 1000 1005 



Asn Phe 
1010 



<210> 11 

<211> 1207 

<212> PRT 

<213> C. elegans 

<400> 11 

Met Ser Thr He Thr Ser Gin Lys Gly He Arg Leu Leu Thr Glu Arg 
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10 15 



Arg Gly Asp Asn Ser Leu lie Leu Thr Leu Thr Leu His Ser Leu Cys 
20 25 30 



Ser Ser Pro His Leu Ser Ser Phe Phe Asp He Gly Cys Gly Phe Leu 
35 40 45 



Ser Pro Asn Asn Lys Asn Ala Met Asn Thr Ser His Asn Ser Phe Phe 
50 55 60 



Phe Phe Leu Leu Leu Phe Leu Phe Ser Phe Phe Leu Pro Phe Ala He 
65 70 75 80 



Gin Leu Phe Gly Lys Leu Pro Asn Ser Lys Lys Met Trp Ala Phe Pro 
85 90 95 



Ala He Leu Ser He Asn Val Asn Leu He Ser Arg Lys Leu Met Val 
100 105 110 



Thr Val He Pro Lys He He Ser Ser Pro Tyr Pro Arg Thr Arg Leu 
115 ' 120 ^ 125 



Pro Leu Tyr Leu Tyr Thr Val Ser He He He Ser Cys Ser Leu Leu 
130 135 140 



Tyr Trp Asn Leu Leu Tyr Cys Lys Asn Tyr Asp Cys Val Val Glu Lys 
145 150 - 155 160 



Glu Phe Arg Trp Gly Ser Thr Arg His Leu Leu Gin Tyr Phe Pro Val 
165 170 175 



He Ala Ala Pro He He Met Val He Ser Phe Ser Trp Leu He He 
180 185 190 



Ala He Tyr Tyr Ser Ser Ser Ser Cys Val Leu Thr Phe Asn Phe Met 
195 200 205 



Glu Met Pro Ser Ala Val Leu Cys Ser Leu Leu Gly Gly He Ser Ser 
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210 215 220 



Val He Glu He His Phe Ser He Glu Val Asn Gin Val Gin Trp Thr 
225 230 235 240 



Asp Gin Trp Leu Leu Ser Ser Val Gly Leu Pro He Asn Asp Cys Leu 
245 250 255 



Lys He Asp lie Phe Arg Asp Leu Gin Tyr Phe Tyr Ala Phe Tyr Met 
260 265 270 



Leu Gin Leu Arg Ser His Phe Asn Asn Pro Ser Asn He Phe Glu Phe 
275 280 285 



Pro He Phe Phe Lys Ser Met Asn Gin Lys Tyr Tyr Val Asn Cys Asp 
290 ~ 295 300 



He Tyr Ser Cys Ser He His Phe Met Lys Lys Gin Lys Lys Met Ser 
305 J 310 315 320 



Phe Ser Gin Ala Gin Asp Val Tyr Leu Arg Leu Lys Gin Glu Lys Glu 
325 330 335 



Glu Glu Lys Gin Arg Glu Arg Ala Glu Arg Glu Lys Arg Asn Glu Thr 
340 345 350 



He Ala Ala Thr Asn Lys Ser Arg Lys Lys Met Asn Gin Ala Leu Ala 
355 360 365 



Lys Arg Asn Lys Lys Gly Gin Pro Asn Leu Asn Ala Gin Met Asp Met 
370 ^ 375 380 



Ala Ser Asp Glu Asn He Gly Ala Asp Gly Glu Gin Lys Pro Ser Arg 
385 ~ 390 395 400 



Pro Phe Leu Arg Lys Gly Gin Gly Thr 'Ala Arg Phe Arg Met Val Val 
405 410 415 



Cys Ala Asn Thr Arg Leu lie Glu He lie Tyr Glu Val Gin Pro Arg 
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420 425 430 



Asn Asn Lys Thr Ser Ala Gly Ala Pro Pro Thr Ser Glu Leu Ser Ser 
435 440 445 



Ala Ser Ser Pro Ser lie Asn Val Pro Arg Phe Ser Leu Ser Asn Ala 
450 455 460 



Leu Pro Asn Ser Ala Arg Thr Val Asp Ser Gly lie Ser Asn Glu Asp 
465 470 475 480 



Glu Thr Arg Pro Pro Thr Thr Ala Ser Leu Pro Met Asp Gin Pro Ser 
485 490 495 



Leu Ser Ser Ser Pro Glu Asn Arg Leu Asn Pro Ala Pro Ser Val Ala 
500 505 510 



Glu Glu His Gly His Ser Gly Gin His Ala Glu Glu Glu Glu Asp Asn 
515 " 520 525 



Asp Thr Asp Glu Val Ser Ala Met Pro Ser Phe Val Pro Asp Glu Pro 
530 ' 535 540 



Ser Thr Leu Val Asn Ser Asp His Glu Leu Ser Asp Asp Ala Leu Lys 
545 550 555 560 



Tyr Lys Asn Ala Ala Ala Glu Phe Lys Ala Phe Glu Arg Arg Met Asp 
565 570 575 



Ser Met Arg Ser Ala Ser Thr lie Thr Thr Ser Leu Ala Thr Pro Ser 
580 585 590 



Ser Cys Ala Pro Ser Asn Ser Ser Glu Pro Pro Thr Arg Ser Thr Pro 
595 600 605 



lie Met Asn Asp Leu Gly Val Gly Pro Asn Asn His Asn Trp Pro Ser 
610 615 620 



Ser Met Gin Glu Leu Ser Gly lie Ser Leu Glu Thr Pro Gin Ala Arg 
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625 630 635 640 



Pro Leu Gly Ser Asn Arg lie Asn Gin Leu Val Arg Ser Glu Ala Gin 
645 650 655 



Thr Gly He Ser Leu Leu Gin His His Glu Arg Pro Thr Val Thr Ala 
660 665 670 



Pro Leu Arg Arg Asn Asp Met Met Asn Ser Ser Arg Gin Asn Pro Gin 
675 ' 680 685 



Asn Gly Asn Val Gin Asp Glu Asn Arg Pro Glu His Val Tyr Asp Gin 
690 695 700 



Pro He His Val Pro Gly Ser Ser Leu Asp Arg Gin Lys Leu Glu He 
705 710 715 720 



Glu He Arg Arg His Arg Asn Leu Asn He Gin Leu Arg Asp Thr He 
725 730 735 



Ala His Leu Asp Tyr Ala Glu Glu Ser Val His Thr Thr Lys Arg Gin 
740 745 750 



Leu Glu Glu Lys He Ser Glu Val Asn Asn Phe Lys Lys Glu Leu He 
755 760 765 



Glu Glu Phe Lys Lys Cys Lys Lys Gly Val Glu Glu Glu Phe Glu Lys 
770 775 ^ 780 



Lys Phe Glu Lys He Lys Glu Asp Tyr Asp Glu Leu Tyr Glu Lys Leu 
785 790 795 800 



Lys Arg Asp Gin Arg Asp Leu Glu Arg Asp Gin Lys He Leu Lys Lys 
805 ~ 810 ' 815 



Gly Thr Gly Glu Arg Asn Lys Glu Phe Thr Glu Thr He Ala Thr Leu 
820 825 830 



Arg Asp Lys Leu Arg Ala Ser Glu Thr Lys Asn Ala Gin Tyr Arg Gin 
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835 840 845 



Asp He Arg Val Arg Asp Glu Lys Leu Lys Lys Lys Asp Glu Glu He 
850 " " ^ 855 860 



Glu Lys Leu Gin Lys Asp Gly Asn Arg Leu Lys Ser Thr Leu Gin Thr 
865 * 870 875 880 



Leu Glu Lys Arg Val Lys Gin Leu Arg Thr Glu Lys Glu Arg Asp Asp 
885 890 895 



Lys Glu Lys Glu Met Phe Ala Lys Val Ala Met Asn Arg Lys Thr Ser 
900 905 910 



Asn Pro Val Pro Pro Val Leu Asn Gin Ser Val Pro He Ser He Thr 
915 920 925 



Ser Asn Gly Pro Ser Arg His Pro Ser Ser Ser Ser Leu Thr Thr Phe 
930 " " 935 940 



Arg Lys Pro Ser Thr Ser Asn Arg Glu Arg Gly Val Ser Trp Ala Asp 
945 ' 950 955 960 



Glu Pro Asn Glu Gin Ser Leu Glu Ala Val Pro Gin Glu Phe Leu Met 
965 970 975 



Met Pro Val Lys Glu Met Pro Gly Lys Phe Gly Lys Cys Thr lie Tyr 
980 985 990 



Arg Asp Ser Leu Gly Glu Thr Ser Lys Val Thr Asp Thr He Ala Asn 
995 1000 1005 



Gly Leu Leu Phe Glu Tyr Ser Asn Gly Asp Leu Arg Trp Val Asn 
1010 1015 * 1020 



Arg Gin Asn Ala Val Asn He Tyr He Ser Ala Val Asp Lys Thr 
1025 1030 1035 



Val Arg He Asp Leu Pro Thr Tyr Asn He Ser He He His Thr 
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1040 1045 1050 



Phe Gin Arg Gin Val Glu Val Leu Arg Pro Gly Asn Asn He Thr 
1055 1060 1065 



Leu He Ser He Lys Arg Arg Glu Val Arg Thr Asp Leu He Tyr 
1070 1075 1080 



Gin Asn Gly Met Tyr Lys Thr Glu He Phe Asn Arg Asp Gly Arg 
1085 J ~ 1090 1095 



Tyr Val Thr Lys Asp Phe Ser Asn Gin Glu Val Ser Arg Lys Tyr 
1100 1 1105 1110 



Asn Pro Gly Thr His Thr Tyr Arg Asp Asn Gin Cys Arg Tyr Val 
1115 1120 1125 



Leu Val Thr Asp Tyr Asn Asp Phe Glu Leu Val Glu Pro Glu Phe 
1130 1135 1140 



Arg Leu Arg Trp Tyr Gin Gly Asp Pro Thr Gly Leu Asn Asn Gin 
1145 " 1150 1155 



Tyr He Leu Lys lie lie Gly Arg Pro Glu Cys Ser Glu Lys Thr 
1160 1165 1170 



Leu Arg Leu Glu Val Asn Leu Ser Thr Cys Glu Gly Thr Leu Glu 
1175 1180 1185 



Thr Ala Glu Met lie Gly Asp Lys Arg Arg Lys Thr Thr Leu Phe 
1190 1195 1200 



Gin Trp Lys Lys 
1205 



<210> 12 

<211> 780 

<212> DNA 

<213> homo sapiens 
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<400> 12 



atgaacaagt 


tgaaatcatc 


qcaqaaqqat 


aaagttcgtc 


agtttatgat 


cttcacacaa 


60 


tctagtgaaa 


aaacagcagt 


aagttgtctt 


tctcaaaatg 


actggaagtt 


aqatqttqca 


120 


acagataatt 


ttttccaaaa 


tcctgaactt 


tatatacgag 


agagtgtaaa 


aggatcattg 


180 




aattacraaca 


actotacaat 


agatacaaag 


accctcaaga 


tgagaataaa 


24C 


attcrcraatacr 


atcrcrcataca 


gcagttctgt 


aataacctqq 


cactcgatcc 


agccagcatt 


30C 


ant nt~ fitters 


1 1 attcrcatcr 


craaattcaaa 


gcagcaacac 


agtgcgagtt 


ctccaaacag 


36C 


riant tp at rrn - 


?i t cr cr c a t o a c 


agaattagga 


tgtgacagca 


tagaacaact 


aaaggcccag 


42C 


atacccaaga 


tggaacaaga 


attgaaagaa 


ccaggacgat 


ttaaggattt 


ttaccagttt 


48C 


acttttaatt 


ttgcaaagaa 


tccaggacaa 


aaaggattag 


atctagaaat 


ggccattgcc 


54C 


tactggaact 


tagtgcttaa 


tggaagattt 


aaattcttag 


acttatggaa 


taaatttttg 


60C 


ttggaacatc 


ataaacgatc 


aataccaaaa 


gacacttgga 


atcttctttt 


agacttcagt 


66C 


acgatgattg 


cagatgacat 


gtctaattat 


gatgaagaag 


gagcatggcc 


tgttcttatt 


72C 


gatgactttg 


tggaatttgc 


acgccctcaa 


attgctggga 


caaaaagtac 


aacagtgtag 


78C 



<210> 13 

<211> 259 

<212> PRT 

<213> homo sapiens 

<400> 13 

Met Asn Lys Leu Lys Ser Ser Gin Lys Asp Lys Val Arg Gin Phe Met 
15 10 15 

lie Phe Thr Gin Ser Ser Glu Lys Thr Ala Val Ser Cys Leu Ser Gin 
20 " 25 30 

Asn Asp Trp Lys Leu Asp Val Ala Thr Asp Asn Phe Phe Gin Asn Pro 
35 40 ' 45 

Glu Leu Tyr lie Arg Glu Ser Val Lys Gly Ser Leu Asp Arg Lys Lys 
50 55 "* 60 

Leu Glu Gin Leu Tyr Asn Arg Tyr Lys Asp Pro Gin Asp Glu Asn Lys 
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65 70 75 80 



lie Gly lie Asp Gly lie Gin Gin Phe Cys Asp Asp Leu Ala Leu Asp 
85 90 95 



Pro Ala Ser He Ser Val Leu He He Ala Trp Lys Phe Arg Ala Ala 
100 105 110 



Thr Gin Cys Glu Phe Ser Lys Gin Glu Phe Met Asp Gly Met Thr Glu 
115 120 125 



Leu Gly Cys Asp Ser He Glu Gin Leu Lys Ala Gin He Pro Lys Met 
130 ~ 135 140 



Glu Gin Glu Leu Lys Glu Pro Gly Arg Phe Lys Asp Phe Tyr Gin Phe 
145 150 155 160 



Thr Phe Asn Phe Ala Lys Asn Pro Gly Gin Lys Gly Leu Asp Leu Glu 
165 170 175 



Met Ala He Ala Tyr Trp Asn Leu Val Leu Asn Gly Arg Phe Lys Phe 
180 185 190 



Leu Asp Leu Trp Asn Lys Phe Leu Leu Glu His His Lys Arg Ser He 
195 200 205 



Pro Lys Asp Thr Trp Asn Leu Leu Leu Asp Phe Ser Thr Met He Ala 
210 215 220 



Asp Asp Met Ser Asn Tyr Asp Glu Glu Gly Ala Trp Pro Val Leu He 
225 230 ' 235 240 



Asp Asp Phe Val Glu Phe Ala Arg Pro Gin He Ala Gly Thr Lys Ser 
245 250 255 



Thr Thr Val 



<210> 14 
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<211> 258 
<212> PRT 
<213> Homo sapiens 

<400> 14 

Ser Lys Gin Glu Phe Met Asp Gly Met Thr Glu Leu Gly Cys Asp Ser 
1 5 10 15 



lie Glu Gin Leu Lys Ala Gin lie Pro Lys Met Glu Gin Glu Leu Lys 
20 ^ 25 30 



Glu Pro Gly Arg Phe Lys Asp Phe Tyr Gin Phe Thr Phe Asn Phe Ala 
35 " 40 45 



Lys Asn Pro Gly Gin Lys Gly Leu Asp Leu Glu Asp Arg Lys Lys Leu 
50 55 60 



Glu Gin Leu Tyr Asn Arg Tyr Lys Asp Pro Gin Asp Glu Asn Lys lie 
65 70 75 80 



Gly lie Asp Gly lie Gin Gin Phe Cys Asp Asp Leu Ala Leu Asp Pro 
85 90 95 



Ala Ser He Ser Val Leu He He Ala Trp Lys Phe Arg Ala Ala Thr 
100 105 110 



Gin Cys Glu Phe Ser Lys Gin Glu Phe Met Asp Gly Met Thr Glu Leu 
115 120 125 



Gly Cys Asp Ser He Glu Gin Leu Lys Ala Gin He Pro Lys Met Glu 
130 135 " 140 



Gin Glu Leu Lys Glu Pro Gly Arg Phe Lys Asp Phe Tyr Gin Phe Thr 
145 150 155 160 



Phe Asn Phe Ala Lys Asn Pro Gly Gin Lys Gly Leu Asp Leu Glu Met 
165 170 175 



Ala He Ala Tyr Trp Asn Leu Val Leu Asn Gly Arg Phe Lys Phe Leu 
180 185 190 
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Asp Leu Trp Asn Lys Phe Leu Leu Glu His His Lys Arg Ser lie Pro 
195 ^ 200 " 205 



Lys Asp Thr Trp Asn Leu Leu Leu Asp Phe Ser Thr Met lie Ala Asp 
210 215 220 



Asp Met Ser Asn Tyr Asp Glu Glu Gly Ala Trp Pro Val Leu lie Asp 
225 230 235 240 



Asp Phe Val Glu Phe Ala Arg Pro Gin He Ala Gly Thr Lys Ser Thr 
245 • 250 255 



Thr Val 



<210> 15 

<211> 19 

<212> DNA 

<213> artificial sequence 
<220> 

<223> T7 polymerase promoter sequence (example 1) 

<400> 15 

taatacgact cactatagg 



<210> 16 

<211> 19 

<212> DNA 

<213> artificial sequence 
<220> 

<223> T3 polymerase promoter sequence 

<400> 16 

aattaaccct cactaaagg 



<210> 17 

<211> 19 

<212> DNA 

<213> artificial sequence 
<220> 
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<223> oligonucleotide for PCR amplification (example 3) 
<400> 17 

tcaatcagta tgtcgaccc 



<210> 18 

<211> 19 

<212> DNA 

<213> artificial sequence 



<220> 

<223> oligonucleotide for PCR amplification (example 3) 
<400> 18 

ggaagaaatt ggggaaaca 



<210> 19 

<211> 19 

<212> DNA 

<213> artificial 



sequence 



<220> 

<223> oligonucleotide for PCR amplification . (example 3) 
<400> 19 

atcgagcgcc tcttcaatc 



<210> 20 

<211> 19 

<212> DNA 

<213> artificial sequence 
<220> 

<223> oligonucleotide for PCR amplification (example 3) 

<400> 20 

tggtgtctcc atttgctga 19 



<210> 21 

<211> 19 

<212> DNA 

<213> artificial sequence 



<220> 

<223> oligonucleotide for PCR amplification (example 4) 



<400> 21 

atctgaagat ccgtccact 
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<210> 22 

<211> 19 

<212> DNA 

<213> artificial sequence 
<220> 

<223> oligonucleotide for PCR amplification (example 4) 



<400> 22 

atgcacaatg ggtattttt 



<210> 23 
<211> 23 
<212> DNA 

<213> artificial sequence 
<220> 

<223> oligonucleotide for PCR amplification (example 5; forward 
primer to generate dsRNA 305A12) 

<400> 23 

ttcgtctcga acacgtatat cct 



<210> 24 
<211> 23 
<212> DNA 

<213> artificial sequence 
<220> 

<223> oligonucleotide for PCR amplification (example 5; reverse 
primer to generate dsRNA 305A12) 

<400> 24 

gaaagaagat gaatcaggca ttg 



<210> 25 
<211> 23 
<212> DNA 

<213> artificial sequence 
<220> 

<223> oligonucleotide for PCR amplification (example 5; forward 
primer to generate dsRNA 341G5) 

<400> 25 

ctgcaaaaat tatgactgtg teg 
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<210> 26 
<211> 21 
<212> DNA 

<213> artificial sequence 
<220> 

<223> oligonucleotide for PCR amplification (example 5; reverse 
primer to generate dsRNA 341G5) 

<400> 26 

agcattcaga tttggttgtc c 



Seite 37 



